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VECTORS FOR OCULAR TRANSDUCTION AND USE THEREOF FOR GENETIC 

THERAPY 

Work described herein was supported by NIH grants EY1 1431 and 

HL54352. The government has certain rights in such subject matter. 
5 RELATED APPLICATIONS 

This application claims the benefit of priority to U.S. application Serial No. 

09/562,934, filed May 1, 2000, to Glen R. Nemerow, Daniel Von Seggern,; 

Martin Friedlander, entitled "VECTORS FOR OCULAR TRANSDUCTION AND USE 

THEREFOR FOR GENETIC THERAPY". 
10 This application is related to copending U.S. application Serial 

No. 09/482,682 (also filed as International PCT application No. 

PCT/USOO/00265, filed January 14, 2000)), to Daniel Von Seggern, Glen R. 

Nemerow, Paul Hallenbeck, Susan Stevenson, Yelena Skripchenko, filed January 

1.4, 2000, entitled "Adenovirus Vectors, Packaging Cell Lines, Compositions, and 
15 Methods for Preparation and Use," which is a continuation-in-part of U.S. 

Application 09/423,783 filed November 12, 1999 and claims the benefit of the 

filing date of U.S. Provisional Application 60/1 15,920 filed January 14, 1999. 

Where permitted, the contents and subject matter of each application and of the 

provisional application are incorporated in their entirety herein by reference. 
20 FIELD OF INVENTION 

The present invention relates to gene therapy, especially to adenovirus 

vector-based gene therapy. In particular, adenovirus vectors for therapy of 

ocular diseases and methods of treatment using the vectors are provided. 

Compositions, kits, and methods of preparation and use of the vectors for gene 
25 therapy are provided. 

BACKGROUND OF THE INVENTION 

Retinal dystrophies 

The eye is susceptible to a number of hereditary and/or age related 

degenerative disorders. In the United States, common causes of irreversible 
30 blindness or severe loss of vision are retinal dystrophies (see, e.g., Cotlier et ah 

(1995) Surv. Ophthalmology 40:51-61; Bird (1995) Am. J. OphthaL 7 79: 

543-562; and Adler (1996) Arch Ophthal 7 74:79-83). The retina is the sensory 
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tunic of the eye, containing light sensitive receptors, a complex of neurons, and 
pigmented epithelium, arranged in discrete layers. In humans, the macula is the 
portion of the retina that lies directly behind the lens. Cones, the photoreceptor 
cells responsible for central vision, are heavily concentrated in the macula. 
5 Central dystrophies, which affect the macula, include Best's disease, age-related 
macular degeneration, and Stargardt's macular dystrophy. The peripheral retina 
is composed mainly of rods, which are responsible for side and night vision. 
Peripheral degenerative retinal diseases include retinitis pigmentosa, choroidemia 
and Bietti's crystalline dystrophy. 

10 Macular degenerations are a heterogenous group of diseases, 

characterized by progressive central vision loss and degeneration of the macula 
and underlying retinal pigmented epithelium. Age-related macular degeneration 
(ARMD) is the most common form of the disease, affecting an estimated 20% of 
persons over 75 years of age. ARMD is poorly understood in terms of etiology 

15 and pathogenesis. The very late onset of the disease has made genetic mapping 
particularly difficult. Certain macular degenerative conditions with a clear genetic 
basis, such as Stargardt's and Best's diseases, share many features with ARMD, 
but have been more amenable to molecular and genetic analysis. 

Hereditary peripheral retinopathies are also relatively common. Retinitis 

20 pigmentosa (RP), for example, affects approximately 1 .5 million people 
worldwide. Substantial genetic heterogeneity has been observed in this 
condition, with over 20 chromosomal loci identified. A predisposition to retinitis 
pigmentosa can be inherited by autosomal dominant, autosomal recessive, 
X-linked or digenic mode. Mutations have been identified in seven genes, four of 

25 which encode proteins in the rod phototransduction cascade: rhodopsin, alpha 
and beta subunits of rod cGMP phosphodiesterase, and rod cGMP cation-gated 
channel protein .alpha, subunit. Mutations in the peripherin/RDS gene have been 
linked to retinitis pigmentosa and macular degeneration. A single peripherin/RDS 
mutation apparently caused retinitis pigmentosa, pattern dystrophy and fundus 

30 flavimaculatus, in different family members. 
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In spite of causal heterogeneity, there is significant clinical similarity 
among RP subtypes. Common signs and symptoms include early 
electroretinographic abnormalities, ophthalmoscopic findings, and 
protracted, contiguous expansion of the ring-like scotoma toward the macula, 
5 leading to progressively worsening tunnel vision. A recent hypothesis is that 
active photoreceptor cell death, which is characteristic of these genetically 
distinct disorders, is mediated by a common induction of apoptosis. It may be 
possible to treat these conditions by the administration of agents that block 
induction of apoptosis in photoreceptors, such as neurotrophic factors. 

1 0 Adenovirus delivery vectors 

Adenovirus, which is a DNA virus with a 36 kilobase (kb) genome, is very 
well-characterized and its genetics and genetic organization are understood. 
The genetic organization of adenoviruses permits substitution of large fragments 
of viral DNA with foreign DNA. In addition, recombinant adenoviruses are 

15 structurally stable and no rearranged viruses are observed after extensive 
amplification. 

Adenoviruses have been employed as delivery vehicles for introducing 
desired genes into eukaryotic cells. The adenovirus delivers such genes to 
eukaryotic cells by binding to cellular receptors followed by internalization. The 

20 adenovirus fiber protein is responsible for binding to cells. The fiber protein has 
two domains, a rod-like shaft portion and a globular head portion that contains 
the receptor binding region. The fiber spike is a homotrimer, and there are 12 
spikes per virion. Human adenoviruses bind to and infect a broad range of 
cultured cell lines and primary tissues from different species. 

25 The 35,000 + base pair (bp) genome of adenovirus type 2 has been 

sequenced and the predicted amino acid sequences of the major coat proteins 
(hexon, fiber and penton base) have been described (see, e.g., Neumann et a/., 
Gene 69: 153-157 (1988); Herisse eta/., Nuc. Acids Res. 9: 4023-4041 (1981); 
Roberts et al., J. Biol. Chem. 259: 13968-13975 (1984); Kinloch et al., J. Biol. 

30 Chem. 259: 6431-6436 (1984); and Chroboczek et al., Virol. 161: 549-554, 
1987). 
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The 35,935 bp sequence of Ad 5 DNA is also known and portions of 
many other adenovirus genomes have been sequenced. The upper packaging 
limit for adenovirus virions is about 105% of the wild-type genome length (see, 
e.g., Bett, eta/., J. Virol. 67(1 0): 591 1-21, 1993). Thus, for Ad2 and Ad5, this 
5 would be an upper packaging limit of about 38kb of DNA. 

Adenovirus DNA also includes inverted terminal repeat sequences (ITRs) 
ranging in size from about 100 to 150 bp, depending on the serotype. The 
inverted repeats permit single strands of viral DNA to circularize by base-pairing 
of their terminal sequences to form base-paired "panhandle" structures that are 

10 required for replication of the viral DNA. 

For efficient packaging, the ITRs and the packaging signal (a few hundred bp in 
length) comprise the "minimum requirement" for replication and packaging of a 
genomic nucleic acid into an adenovirus particle. Helper-dependent vectors 
lacking all viral ORFs but including these essential cis elements (the ITRs and 

15 contiguous packaging sequence) have been constructed. 

Ad vectors have several distinct advantages as gene delivery vehicles. 
For example, recombination of such vectors is rare; there are no known 
associations of human malignancies with adenoviral infections despite common 
human infection with adenoviruses; the genome may be manipulated to 

20 accommodate foreign genes of a fairly substantial size; and host proliferation is 
not required for expression of adenoviral proteins. Adenovirus (Ad)-based gene 
delivery vectors efficiently infect many different cells and tissues. This broad 
tropism, however, means that gene delivery cannot be directed to a specific 
target cell. A large fraction of intravenously administered adenovirus is 

25 retained by the liver, which could lead to undesirable side-effects. Adenovirus 
may potentiate immune responses. For example, Adenovirus type 5 (Ad5) also 
transduces dendritic cells, which present antigens very efficiently, thereby 
possibly exacerbating the immune response against the vector. It has been 
proposed that vectors with different targeting efficiencies might eliminate these 

30 problems, permitting a lower total particle dose and more specific targeting (see, 
e.g., U.S. application Serial No. 09/482,682). 
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The wealth of information on adenovirus structure and mechanism of 
infection, its efficient infection of nondividing cells, and its large genetic capacity 
make adenovirus a popular gene therapy vector. The wide expression of 
receptors to which adenovirus binds makes targeting adenovirus vectors 
5 difficult. 

Hence there is a need to improve delivery and targeting of adenoviral 
vectors and also to provide treatments for ocular disorders. Therefore, it is an 
object herein to provide adenoviral vectors that specifically or selectively target 
cells in the eye. It is also an object herein to provide these vectors for treatment 

10 of ocular disorders. 

SUMMARY OF THE INVENTION 

Degenerative ocular diseases, such as, but not limited to, retinitis 
pigmentosa, Stargardt's disease, diabetic retinopathies, retinal vascularization, 
and others (see, e.g., Table below), have a genetic basis. Genes expressed in 

15 the photoreceptor cells at the back of the retina are implicated in these diseases. 
Provided herein are recombinant viral vectors for targeting therapeutic products 
to these cells. 

Recombinant adenoviral vectors that include nucleic acid that permits 
specific binding to these photoreceptors are provided. In particular, the vector 

20 particles contain a fiber protein of Ad37 or a modified form thereof. As shown 
herein, fiber protein from Ad37 permits efficient infection of photoreceptor cells. 
Fiber proteins from other adenovirus D serotypes may also be used. In addition, 
the portions of the fiber protein, particularly those that interact with other viral 
structural proteins, such as penton, may be modified to resemble the viral source 

25 of the other structural proteins. As exemplified herein, the recombinant virus 

provided herein include Ad5 structural components. The N-terminus of the Ad37 
fiber protein, which interacts with the penton protein, is modified to resemble 
the Ad5 fiber protein N-terminus to ensure production of viral particles. 

The recombinant adenoviral vectors are intended for gene therapy of 

30 diseases in which genes expressed in the photoreceptors are implicated. Such 
diseases include, but are not limited to, degenerative ocular diseases, such as 
retinitis pigmentosa and Stargardt's disease. These vectors are also useful for 
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targeting to other ocular cells, such as conjunctival cells, which also bear 
receptors to which fiber from Ad37 and related serotypes bind. 

The vectors will deliver therapeutic agents to the targeted cells for 
treatment of a variety of disorders (see e.g., Tables 3 and 4, below)). The 
5 therapeutic agents are intended for expression in the photoreceptors and for 
secretion from the photorecptor cells, which are surrounded on one side by 
choroidal vasculature, and on the other side by retinal vasulature, thereby 
providing a means for delivery of products. In addition, expression of growth 
factors, such as VEGF and others, can be used to enhance blood flow to the 

10 retina and prevent or slow the degeneration. 

Therapeutic agents encoded by the recombinant adenoviral vectors 
include, but are not limited to, nucleic acid nucleic acid molecules encoding 
genes that are defective in certain hereditary disorders, nucleic acid molecules 
that encode antiangiogenics and antitumor agents for treatment of retinal 

15 disorders, such as retinoblastomas; nucleic acid molecules encoding trophic 
factors, such as glial cell line-derived neuroptrophic factor (GDNF) and ciliary 
neurotrophic factor (CNTF), growth factors and growth factor inhibitors, 
antiapoptotic factors, such as Bcl-2 (CNTF), antitumor agents, anti-angiogenics, 
and genes or portions thereof for gene replacement or repair of defective genes. 

20 Hence, methods for treatment of inherited and acquired retinal diseases, 

including diseases involving neovascular and vascular degeneration are provided. 

Methods for treating diseases involving genes expressed in photoreceptor 
cells are provided herein. The methods provided herein are practiced by 

25 administration of the recombinant viral vectors by any means suitable for 

delivery to the photoreceptors. A preferred mode of administration is intraocular 
injection including intravitreal and subretinal injection. Other modes of 
administration include, but are not limited to, intrascleral, periorbital and 
intravenous administration. The vectors also can include photoreceptor-specific 

30 promoters thereby providing a means, not only for specific targeting of 
expression in these cells, but also for photoreceptor-restricted transgene 
expression. 
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DETAILED DESCRIPTION OF THE INVENTION 
A. DEFINITIONS 

Unless defined otherwise, all technical and scientific terms used herein 
have the same meaning as is commonly understood by one of skill in the art to 
5 which this invention belongs. All patents, applications, published applications 
and other publications and sequences from GenBank and other data bases 
referred to anywhere in the disclosure herein are incorporated by reference in 
their entirety. 

As used herein, the amino acids, which occur in the various amino acid 
10 sequences appearing herein, are identified according to their three-letter or one- 
letter abbreviations. The nucleotides, which occur in the various DNA 
fragments, are designated with the standard single-letter designations used 
routinely in the art (see, Table 1). 

As used herein, amino acid residue refers to an amino acid formed upon 
15 chemical digestion (hydrolysis) of a polypeptide at its peptide linkages. The 
amino acid residues described herein are preferably in the "L" isomeric form. 
However, residues in the "D" isomeric form can be substituted for any L-amino 
acid residue, as long as the desired functional property is retained by the 
polypeptide. NH 2 refers to the free amino group present at the amino terminus 
20 of a polypeptide. COOH refers to the free carboxy group present at the carboxyl 
terminus of a polypeptide. In keeping with standard polypeptide nomenclature 
described in J. Biol. Chem., 243:3552-59 (1969) and adopted at 37 C.F.R. § § 
1 .821 - 1 .822, abbreviations for amino acid residues are shown in the following 
Table: 

25 Table 1 



Table of Correspondence 



SYMBOL 




1 -Letter 


3-Letter 


AMINO ACID 


Y 


Tyr 


tyrosine 


G 


Gly 


glycine 


F 


Phe 


phenylalanine 


M 


Met 


methionine 
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SYMBOL 




A 


Ala 


alanine 


S 


Ser 


serine 


1 


lie 


isoleucine 


L 


Leu 


leucine 


T 


Thr 


threonine 


V 


Val 


valine 


P 


Pro 


proline 


K 


Lys 


lysine 


H 


His 


histidine : 


Q 


Gin 


glutamine 


E 


Glu 


glutamic acid 


Z 


Glx 


Glu and/or Gin 


W 


Trp 


tryptophan 


R 


Arg 


arginine 


D 


Asp 


aspartic acid 


N 


Asn 


asparagine 


B 


Asx 


Asn and/or Asp 


C 


Cys 


cysteine 


X 


Xaa 


Unknown or other 



It should be noted that all amino acid residue sequences represented . 
herein by formulae have a left to right orientation in the conventional direction of 
amino-terminus to carboxyl-terminus. In addition, the phrase "amino acid 
residue" is broadly defined to include the amino acids listed in the Table of 
25 Correspondence and modified and unusual amino acids, such as those referred 
to in 37 C.F.R. § § 1.821-1.822, and incorporated herein by reference. 
Furthermore, it should be noted that a dash at the beginning or end of an amino 
acid residue sequence indicates a peptide bond to a further sequence of one or 
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more amino acid residues or to an amino-terminal group such as NH 2 or to a 
carboxyl-terminal group such as COOH. 

In a peptide or protein, suitable conservative substitutions of amino acids 
are known to those of skill in this art and may be made generally without altering 
5 the biological activity of the resulting molecule. Those of skill in this art 
recognize that, in general, single amino acid substitutions in non-essential 
regions of a polypeptide do not substantially alter biological activity (see, e.g. , 
Watson et aL Molecular Biology of the Gene, 4th Edition, 1 987, The 
Bejacmin/Cummings Pub. co., p. 224). 
10 Such substitutions are preferably made in accordance with those set forth 

in TABLE 2 as follows: 

TABLE 2 





Original residue 


Conservative substitution 




Ala (A) 


Gly; Ser 


15 


Arg (R) 


Lys 




Asn (N) 


Gin; His 




Cys (C) 


Ser 




Gin (Q) 


Asn 




Glu (E) 


Asp 


20 


Gly (G) 


Ala; Pro 




His (H) 


Asn; Gin 




He (I) 


Leu; Val 




Leu (L) 


lie; Val 




Lys (K) 


Arg; Gin; Glu 


25 


Met (M) 


Leu; Tyr; lie 




Phe (F) 


Met; Leu; Tyr 




Ser (S) 


Thr 




Thr (T) 


Ser 




Trp (W) 


Tyr 


30 


Tyr (Y) 


Trp; Phe 




Val (V) 


lie; Leu 



Other substitutions are also permissible and may be determined empirically or in 
accord with known conservative substitutions. 
35 As used herein, a complementing plasmid describes plasmid vectors that 

deliver nucleic acids into a packaging cell line for stable integration into a 
chromosome in the cellular genome. 

As used herein, a delivery plasmid is a plasmid vector that carries or 
delivers nucleic acids encoding a therapeutic gene or gene that encodes a 
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therapeutic product or a precursor thereof or a regulatory gene or other factor 
that results in a therapeutic effect when delivered in vivo in or into a cell line, 
such as, but not limited to a packaging cell line, to propagate therapeutic viral 
vectors. 

5 As used herein, a variety of vectors with different requirements are 

described. For example, one vector is used to deliver particular nucleic acid 
molecules into a packaging cell line for stable integration into a chromosome. 
These types of vectors are generally identified herein as complementing 
plasmids. A further type of vector described herein carries or delivers nucleic 

10 acid molecules in or into a cell line (e.g., a packaging cell line) for the purpose of 
propagating therapeutic viral vectors; hence, these vectors are generally referred 
to herein as delivery plasmids. A third "type" of vector described herein is used 
to carry nucleic acid molecules encoding therapeutic proteins or polypeptides or 
regulatory proteins or are regulatory sequences to specific cells or cell types in a 

15 subject in need of treatment; these vectors are generally identified herein as 
therapeutic viral vectors or recombinant adenoviral vectors or viral Ad-derived , 
vectors and are in the form of a virus particle encapsulating a viral nucleic acid 
containing an expression cassette for expressing the therapeutic gene. 

As used herein, a DNA or nucleic acid homolog refers to a nucleic acid 

20 that includes a preselected conserved nucleotide sequence, such as a sequence 
encoding a therapeutic polypeptide. By the term "substantially homologous" is 
meant having at least 80%, preferably at least 90%, most preferably at least 
95% homology therewith or a lesser percentage of homology or identity and 
conserved biological activity or function. 

25 The terms "homology" and "identity" are often used interchangeably. In 

this regard, percent homology or identity may be determined, for example, by 
comparing sequence information using a GAP computer program. The GAP 
program utilizes the alignment method of Needleman and Wunsch (J. Mo/. Biol. 
48:443 (1970), as revised by Smith and Waterman (Adv. Appl. Math. 2:482 

30 (1981), Briefly, the GAP program defines similarity as the number of aligned 

symbols (i.e., nucleotides or amino acids) which are similar, divided by the total 
number of symbols in the shorter of the two sequences. The preferred default 
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parameters for the GAP program may include: (1) a unary comparison matrix 
(containing a value of 1 for identities and 0 for non-identities) and the weighted 
comparison matrix of Gribskov and Burgess, Nucl. Acids Res. 14:6745 (1986), 
as described by Schwartz and Dayhoff, eds., ATLAS OF PROTEIN SEQUENCE 
5 AND STRUCTURE, National Biomedical Research Foundation, pp. 353-358 

(1979); (2) a penalty of 3.0 for each gap and an additional 0.10 penalty for each 
symbol in each gap; and (3) no penalty for 
end gaps. 

Whether any two nucleic acid molecules have nucleotide sequences that 

10 are at least 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% "identical" can be 
determined using known computer algorithms such as the "FAST A" program, 
using for example, the default parameters as in Pearson and Lipman, Proc. Natl. 
Acad. ScL USA 55:2444 (1988). Alternatively the BLAST function of the 
National Center for Biotechnology Information database may be used to 

15 determine identity. 

In general, sequences are aligned so that the highest order match is 
obtained. "Identity" per se has an art-recognized meaning and can be calculated 
using published techniques. (See, e.g.: Computational Molecular Biology, Lesk, 
A.M., ed., Oxford University Press, New York, 1988; Biocomputing: Informatics 

20 and Genome Projects, Smith, D.W., ed., Academic Press, New York, 1993; 
Computer Analysis of Sequence Data r Part I f Griffin, A.M., and Griffin, H.G., 
eds., Humana Press, New Jersey, 1 994; Sequence Analysis in Molecular 
Biology, von Heinje, G., Academic Press, 1987; and Sequence Analysis Primer, 
Gribskov, M. and Devereux, J., eds., M Stockton Press, New York, 1991). 

25 While there exist a number of methods to measure identity between two 

polynucleotides or polypeptide sequences, the term "identity" is well known to 
skilled artisans (Carillo, H. & Upton, D., S/AM J Applied Math 48:1073 (1988)). 
Methods commonly employed to determine identity or similarity between two 
sequences include, but are not limited to, those disclosed in Guide to Huge 

30 Computers, Martin J. Bishop, ed., Academic Press, San Diego, 1994, and 
Carillo, H. & Lipton, D., S/AM J Applied Math 45:1073 (1988). Methods to 
determine identity and similarity are codified in computer programs. Preferred 
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computer program methods to determine identity and similarity between two 
sequences include, but are not limited to, GCG program package (Devereux, J., 
et aL, Nucleic Acids Research 72(0:387 (1984)), BLASTP, BLASTN, FASTA 
(Atschul, S.F., et aL, J Molec Biol 275:403 (1 990)). 
5 Therefore, as used herein, the term "identity" represents a comparison 

between a test and a reference polypeptide or polynucleotide. For example, a 
test polypeptide may be defined as any polypeptide that is 90% or more 
identical to a reference polypeptide. As used herein, the term at least "90% 
identical to" refers to percent identities from 90 to 99.99 relative to the 

10 reference polypeptides. Identity at a level of 90% or more, is indicative of the 
fact that, assuming for exemplification purposes a test and reference 
polynucleotide length of 100 amino acids are compared. No more than 10% 
(i.e., 10 out of 100) amino acids in the test polypeptide differs from that of the 
reference polypeptides. Similar comparisons may be made between a test and 

15 reference polynucleotides. Such differences may be represented as point 

mutations randomly distributed over the entire length of an amino acid sequence 
or they may be clustered in one or more locations of varying length up to the 
maximum allowable, e.g. 10/100 amino acid difference (approximately 90% 
identity). Differences are defined as nucleic acid or amino acid substitutions, or 

20 deletions. 

As used herein, genetic therapy involves the transfer of heterologous 
DNA to the certain cells, target cells, of a mammal, particularly a human, with a 
disorder or conditions for which such therapy is sought. The DNA is introduced 
into the selected target cells in a manner such that the heterologous DNA is 

25 expressed and a therapeutic product encoded thereby is produced. 

Alternatively, the heterologous DNA may in some manner mediate expression of 
DNA that encodes the therapeutic product, it may encode a product, such as a 
peptide or RNA that in some manner mediates, directly or indirectly, expression 
of a therapeutic product. Genetic therapy may also be used to deliver nucleic 

30 acid encoding a gene product to replace a defective gene or supplement a gene 
product produced by the mammal or the cell in which it is introduced. The 
introduced nucleic acid may encode a therapeutic compound, such as a growth 
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factor inhibitor thereof, or a tumor necrosis factor or inhibitor thereof, such as a 
receptor therefor, that is not normafly produced in the mammalian host or that is 
not produced in therapeutically effective amounts or at a therapeutically useful 
time. The heterologous DNA encoding the therapeutic product may be modified 
5 prior to introduction into the cells of the afflicted host in order to enhance or 
otherwise alter the product or expression thereof. 

As used herein, heterologous DNA is DNA that encodes RNA and proteins 
that are not normally produced in vivo by the cell in which it is expressed or that 
mediates or encodes mediators that alter expression of endogenous DNA by 

10 affecting transcription, translation, or other regulatable biochemical processes. 
Heterologous DNA may also be referred to as foreign DNA. Any DNA that one 
of skill in the art would recognize or consider as heterologous or foreign to the 
cell in which it is expressed is herein encompassed by heterologous DNA. 
Examples of heterologous DNA include, but are not limited to, DNA that encodes 

1 5 traceable marker proteins, such as a protein that confers drug resistance, DNA 
that encodes therapeutically effective substances, such as anti-cancer agents, 
enzymes and hormones, and DNA that encodes other types of proteins, such as 
antibodies. Antibodies that are encoded by heterologous DNA may be secreted 
or expressed on the surface of the cell in which the heterologous DNA has been 

20 introduced. 

Hence, herein heterologous DNA or foreign DNA, refers to a DNA 
molecule not present in the exact orientation and position as the counterpart 
DNA molecule found in the corresponding wild-type adenovirus. It may also 
refer to a DNA molecule from another organism or species [i.e., exogenous) or 

25 from another Ad serotype. 

As used herein, a therapeutically effective product is a product that is 
encoded by heterologous DNA that, upon introduction of the DNA into a host, a 
product is expressed that effectively ameliorates or eliminates the symptoms, 
manifestations of an inherited or acquired disease or that cures said disease. 

30 Typically, DNA encoding the desired heterologous DNA is cloned into a 

plasmid vector and introduced by routine methods, such as calcium-phosphate 
mediated DNA uptake {see, (1981) Somat. Cell. Mol. Genet. 7:603-616) or 
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microinjection, into producer cells, such as packaging cells. After amplification 
in producer cells, the vectors that contain the heterologous DNA are introduced 
into selected target cells. 

As used herein, an expression or delivery vector refers to any plasmid or 
5 virus into which a foreign or heterologous DNA may be inserted for expression in 
a suitable host cell — i.e., the protein or polypeptide encoded by the DNA is 
synthesized in the host cell's system. Vectors capable of directing the 
expression of DNA segments (genes) encoding one or more proteins are referred 
to herein as "expression vectors." Also included are vectors that allow cloning 
10 of cDNA (complementary DNA) from mRNAs produced using reverse 
transcriptase. 

As used herein, a gene is a nucleic acid molecule whose nucleotide 
sequence encodes RNA or polypeptide. A gene can be either RNA or DNA. 
Genes may include regions preceding and following the coding region (leader and 

15 trailer) as well as intervening sequences (introns) between individual coding 
segments (exons). 

As used herein, tropism with reference to an adenovirus refers to the 
selective infectivity or binding that is conferred on the particle by the fiber 
protein, such as by the C-terminus portion that comprises the knob. 

20 As used herein, isolated with reference to a nucleic acid molecule or 

polypeptide or other biomolecule means that the nucleic acid or polypeptide has 
separated from the genetic environment from which the polypeptide or nucleic 
acid were obtained. It may also mean altered from the natural state. For 
example, a polynucleotide or a polypeptide naturally present in a living animal is 

25 not "isolated," but the same polynucleotide or polypeptide separated from the 
coexisting materials of its natural state is "isolated", as the term is employed 
herein. Thus, a polypeptide or polynucleotide produced and/or contained within 
a recombinant host cell is considered isolated. Also intended as an "isolated 
polypeptide" or an "isolated polynucleotide" are polypeptides or polynucleotides 

30 that have been purified, partially or substantially, from a recombinant host cell or 
from a native source. For example, a recombinantly produced version of a 
compound can be substantially purified by the one-step method described in 
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Srnith and Johnson, Gene 57/31-40 (1988). The terms isolated and purified are 
sometimes used interchangeably. 

Thus, by "isolated" is meant that the nucleic acid is free of the coding 
sequences of those genes that, in the naturally-occurring genome of the 
5 organism (if any) immediately flank the gene encoding the nucleic acid of 

interest. Isolated DNA may be single-stranded or double-stranded, and may be 
genomic DNA, cDNA, recombinant hybrid DNA, or synthetic DNA. It may be 
identical to a native DNA sequence, or may differ from such sequence by the 
deletion, addition, or substitution of one or more nucleotides. 

10 Isolated or purified as it refers to preparations made from biological cells 

or hosts means any cell extract containing the indicated DNA or protein including 
a crude extract of the DNA or protein of interest. For example, in the case of a 
protein, a purified preparation can be obtained following an individual technique 
or a series of preparative or biochemical techniques and the DNA or protein of 

15 interest can be present at various degrees of purity in these preparations. The 
procedures may include for example, but are not limited to, ammonium sulfate 
fractionation, gel filtration, ion exchange chromatography, affinity 
chromatography, density gradient centrifugation and electrophoresis. 

A preparation of DNA or protein that is "substantially pure" or "isolated" 

20 should be understood to mean a preparation free from naturally occurring 
materials with which such DNA or protein is normally associated in nature. 
"Essentially pure" should be understood to mean a "highly" purified preparation 
that contains at least 95% of the DNA or protein of interest. 

A cell extract that contains the DNA or protein of interest should be 

25 understood to mean a homogenate preparation or cell-free preparation obtained 
from cells that express the protein or contain the DNA of interest. The term 
"cell extract" is intended to include culture media, especially spent culture media 
from which the cells have been removed. 

As used herein, a packaging cell line is a cell line that provides a missing 

30 gene product or its equivalent. 

As used herein, an adenovirus viral particle is the minimal structural or 
functional unit of a virus. A virus can refer to a single particle, a stock of 
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particles or a viral genome. The adenovirus (Ad) particle is relatively complex 
and may be resolved into various substructures. 

As used herein, "penton" or "penton complex" are preferentially used 
herein to designate a complex of penton base and fiber. The term "penton" may 
5 also be used to indicate penton base, as well as penton complex. The meaning 
of the term "penton" alone should be clear from the context within which it is 
used. 

As used herein, a plasmid refers to an autonomous self-replicating 
extrachromosomal circular nucleic acid molecule, typically DNA. 

10 As used herein, a post-transcription regulatory element (PRE) is a 

regulatory element found in viral or cellular messenger RNA that is not spliced, 
i.e. intronless messages. Examples include, but are not limited to, human 
hepatitis virus, woodchuck hepatitis virus, the TK gene and mouse histone gene. 
The PRE may be placed before a polyA sequence and after a heterologous DNA 

15 sequence. 

As used herein, pseudotyping describes the production of adenoviral 
vectors having modified capsid protein or capsid proteins from a different 
serotype than the serotype of the vector itself. One example, is the production 
of an adenovirus 5 vector particle containing an Ad37 fiber protein. This may be 

20 accomplished by producing the adenoviral vector in packaging cell lines 
expressing different fiber proteins. 

As used herein, promoters of interest herein may be inducible or 
constitutive. Inducible promoters will initiate transcription only in the presence 
of an additional molecule; constitutive promoters do not require the presence of 

25 any additional molecule to regulate gene expression, a regulatable or inducible 
promoter may also be described as a promoter where the rate or extent of RNA 
polymerase binding and initiation is modulated by external stimuli. Such stimuli 
include, but are not limited to various compounds or compositions, light, heat, 
stress and chemical energy sources. Inducible, suppressible and repressible 

30 promoters are considered regulatable promoters. Preferred promoters herein, are 
promoters that are selectively expressed in ocular cells, particularly 
photoreceptor cells. 
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As used herein, receptor refers to a biologically active molecule that 
specifically or selectively binds to (or with) other molecules. The term "receptor 
protein" may be used to more specifically indicate the proteinaceous nature of a 
specific receptor. 

5 As used herein, recombinant refers to any progeny formed as the result of 

genetic engineering. This may also be used to describe a virus formed by 
recombination of plasmids in a packaging cell. 

As used herein, a transgene or therapeutic nucleic acid molecule includes 
DNA and RNA molecules encoding an RNA or polypeptide. Such molecules may 

10 be "native " or naturally-derived sequences; they may also be "non-native" or 
"foreign" that are naturally- or recombinantly-derived. The term "transgene," 
which may be used interchangeably herein with the term "therapeutic nucleic 
acid molecule," is often used to describe a heterologous or foreign (exogenous) 
gene that is carried by a viral vector and transduced into a host cell. 

15 Therefore, therapeutic nucleotide nucleic acid molecules include antisense 

sequences or nucleotide sequences which may be transcribed into antisense 
sequences. Therapeutic nucleotide sequences (or transgenes) all include nucleic 
acid molecules that function to produce a desired effect in the cell or cell nucleus 
into which said therapeutic sequences are delivered. For example, a therapeutic 

20 nucleic acid molecule can include a sequence of nucleotides that encodes a 
functional protein intended for delivery into a cell which is unable to produce 
that functional protein. 

As used herein, the vitreous of the eye refers to material that fills the 
chamber behind the lens of the eye (i.e., vitreous humor or vitreous body). 

25 As used herein, a promoter region refers to the portion of DNA of a gene 

that controls transcription of the DNA to which it is operatively linked. The 
promoter region includes specific sequences of DNA that are sufficient for RNA 
polymerase recognition, binding and transcription initiation. This portion of the 
promoter region is referred to as the promoter. In addition, the promoter region 

30 includes sequences that modulate this recognition, binding and transcription 

initiation activity of the RNA polymerase. These sequences may be cis acting or 
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may be responsive to trans acting factors. Promoters, depending upon the 
nature of the regulation, may be constitutive or regulated. 

Thus, promoters are nucleic acid fragments that contain a DNA sequence 
that controls the expression of a gene located 3' or downstream of the 
5 promoter. The promoter is the DNA sequence to which RNA polymerase 
specifically binds and initiates RNA synthesis (transcription) of that gene, 
typically located 3' of the promoter. A promoter also includes DNA sequences 
that direct the initiation of transcription, including those to which RNA 
polymerase specifically binds. If more than one nucleic acid sequence encoding a 

10 particular polypeptide or protein is included in a therapeutic viral vector or 
nucleotide sequence, more than one promoter or enhancer element may be 
included, particularly if that would enhance efficiency of expression. 

A regulatable or inducible promoter may be described as a promoter 
wherein the rate of RNA polymerase binding and initiation is modulated by 

15 external stimuli, (see, e.g., U.S. Patent Nos. 5,750,396 and 5,998,205). Such 
stimuli include various compounds or compositions, light, heat, stress, chemical 
energy sources, and the like. Inducible, suppressible and repressible promoters 
are considered regulatable promoters. 

Regulatable promoters may also include tissue-specific promoters. 

20 Tissue-specific promoters direct the expression of the gene to which they are 

operably linked to a specific cell type. Tissue-specific promoters cause the gene 
located 3' of it to be expressed predominantly, if not exclusively, in the specific 
cells where the promoter expressed its endogenous gene. Typically, it appears 
that if a tissue-specific promoter expresses the gene located 3' of it at all, then 

25 it is expressed appropriately in the correct cell types (see, e.g., Palmiter et al. 
(1986) Ann. Rev. Genet. 20: 465-499). 

As used herein, the phrase "operatively linked" generally means the 
sequences or segments have been covalently joined into one piece of DNA, 
whether in single or double stranded form, whereby control sequences on one 

30 segment control expression or replication or other such control of other 
segments. The two segments are not necessarily contiguous. 
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As used herein, exogenous encompasses any therapeutic composition 
that is administered by the therapeutic methods provided herein. Thus, 
exogenous may also be referred to herein as foreign, or non-native or other 
equivalent expression. 
5 B. Ad37 fiber tropism 

The adenovirus fiber protein is a major determinant of adenovirus tropism 
{GaUetaf. (1996) J. ViroL 70:2116-2123; Stevenson eta/. (1 995) J. Virol. 
63:2850-2857). The fiber protein extends from the capsid and mediates viral 
binding to the cell surface by binding to specific cell receptors (Philipson et a/. 

10 (1968) J. Virol. 2:1064-1075). The fiber is a trimeric protein that includes an N- 
terminal tail domain that interacts with the adenovirus penton base, a central 
shaft domain of varying length, and a C-terminal knob domain that contains the 
cell receptor binding site (Chroboczek et aL (1995) Curr.Top. Microbiol. Immunol. 
733:163-200; Riurok et al. (1990) J.Mol.Biol. 275:589-596; Stevenson et al. 

15 (1995) J. ViroL 63:2850-2857). Fiber proteins of most adenovirus subgroups 

have been shown to bind specifically or selectively to the 46 kDa coxsackievirus- 
adenovirus receptor (CAR), (Bergelson et aL (1997) Science 275:1320-1323; 
Roelvink et aL (1998) J. Virol. 72:7909-7915). CAR appears to be expressed in 
a variety of human tissues, including the lung, at various levels (Bergelson et aL 

20 (1997) Science 275:1320-1323), but Ad37 binds poorly to lung epithelial cells 
(Huang et aL (1999) J. ViroL 75:2798-2802). This suggests that the tropism of 
this serotype may be influenced by factors independent of CAR expression. 

Structural and biochemical data also suggest that distinct receptor binding 
sites are located on different regions of the Ad5 and Ad37 fiber knobs. 

25 Adopting the nomenclature of Xia et al. (Xia et al. (1994) Structure 2:1259- 

1 270), the receptor binding site for Ad5 is located at the AB-loop on the side of 
the fiber knob (Bewley et aL (1999) Science 256:1579-1583; Roelvink et aL 
(1999) Science 286'A 568-1571). It is known that a lysine residue at position 
240 of the Ad37 fiber, located in the CD-loop, is important for receptor binding 

30 (Huang et aL (1999) J. ViroL 75:2798-2802). The co-crystal structure of the 
Ad 12 knob and the N-terminal domain of CAR (Bewley et aL (1999) Science 
256:1579-1583) show that the CD-loop does not contact CAR- It thus appears 
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that different regions of the Ad5 and Ad37 fiber knobs recognize distinct cell 
receptors. 

A 46 kDa receptor for coxsackieviruses and adenoviruses (CAR) mediates 
attachment for many adenovirus serotypes. The wide distribution of CAR fails 
5 to explain why certain adenovirus serotypes (i.e. Ad37) are highly associated 
with severe ocular infections such as epidemic keratoconjunctivitis (EKC). Ad37 
does not use CAR, but instead uses a glycoprotein that contains sialic acid as its 
primary receptor (Arnberg et aL ((2000) J. Virol. 74:42-48). The modest 
number of Ad37 binding sites per cell (Huang et al. (1999) J. Virol. 73:2798- 

10 2802) also suggests that Ad37 recognizes a specific glycoprotein as its primary 
receptor for binding to conjunctival cells. 

Adenovirus type 37 (subgroup D) has been associated with infections of 
the eye and genital tract. The tropism of Ad37 derives from the binding 
preference of its fiber protein, which binds to a receptor located on the surface 

15 of cells including Chang C, conjunctival epithelial cell line (Huang et aL (1999) J. 
Virology 73:2798-2802). 

A protein receptor that is preferentially expressed on conjunctival cells to 
which Ad37 fiber binds is shown herein. The preferential expression of the 
Ad37 receptor protein on conjunctival cells suggests that this receptor likely 

20 influences Ad37 tropism and should play a key role in ocular pathogenesis. It is 
shown herein that Ad37 uses a distinct protein receptor that is selectively 
expressed on conjunctival cells. It is shown that Ad37 binds well to conjunctival 
cells (Chang C), but poorly to lung carcinoma cells (A549). To determine if 
infection correlated with cell binding, an Ad5 vector containing the Ad37 fiber 

25 protein was constructed. The 'pseudotyped' vector delivered transgenes to 
Chang C cells better than to A549 cells. Ad37 binding was abolished by 
protease treatment of Chang C cells, indicating the receptor is a membrane 
protein. Ad37 binding to conjunctival cells is shown herein to be calcium- 
dependent. It is also shown that Ad37 infection was not inhibited by a function- 

30 blocking anti-CAR monoclonal antibody, which is a feature distinct from Ad5 
fiber interaction with CAR. Using a virus overlay protein blot assay (VOPBA), 
calcium-dependent Ad37 binding to a 50 KDa membrane protein on Chang C 
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cells, but not A549 cells was detected. Ad19p a closely related serotype that 
fails to bind to conjunctival cells, does not recognize the 50 kDa protein. 
Together, these data indicate that the 50 kDa protein is a candidate receptor for 
Ad37 on conjunctival cells. 
5 Significantly, it is also shown herein that, upon administration of the 

vector to the vitreous humor, the recombinant adenovirus with the Ad37 fiber 
preferentially and selectively binds to photoreceptor cells. Hence, a recombinant 
adenoviral delivery vehicle that has an Ad37 fiber protein can serve as a vector 
for delivery of therapeutic agents to the eye for treatment of ocular disorders, 

10 including genetic and acquired disorders. The identification of the receptor for 
Ad37 and the resulting recognition of Ad37 tropism allows targeting of 
adenovirus vectors to specific human ocular cells. 

As noted, fiber plays a crucial role in adenovirus infection by attaching 
the virus to a specific receptor on a cell surface. Hexon, penton and fiber 

15 capsomeres are the major components on the surface of the virion. The fiber is 
an elongated protein which exists as a trimer of three identical polypeptides 
(polypeptide IV) of 582 amino acids in length. An adenovirus fiber includes 
three domains: an N-terminal tail domain that interacts with penton base; a 
shaft composed of variable numbers of repeats of a 1 5-amino-acid segment that 

20 forms beta-sheet and beta-bends; and a knob at the C-terminus ("head domain") 
that contains the type-specific antigen and is responsible for binding to the cell 
surface receptor. The gene encoding the fiber protein from Ad2 has been 
expressed in human cells and has been shown to be correctly assembled into 
trimers, glycosylated and transported to the nucleus (see, e.g., Hong and Engler, 

25 Virology 785: 758-761, 1991). Thus, alteration of the fiber in recombinant Ad 
vectors can lead to alteration in gene delivery. 

As shown herein, alteration of fiber in recombinant Ad vectors such that 
the fiber is derived from Ad37 or another adenovirus serotype D, provides a 
means for selective delivery of a recombinant virus to particular cells in the eye, 

30 including conjunctival cells, and most significantly photoreceptors, thereby 
providing a means for targeted delivery to photoreceptor cells. 
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Photoreceptor cells are implicated in a number of hereditary and acquired 
retinal degenerative disorders. In addition, photoreceptor cells are located such 
that products produced therein can be delivered to other areas of the eye by 
virtue of the blood flow in the vicinity of the photoreceptor cells and also by 
5 virtue of the proximity of the photoreceptors to the retinal pigmented epithelium 
(RPE) and other retinal cells. 

Hence it is contemplated herein that the recombinant viral vector will 
include a packaged recombinant adenovirus genome containing at least the 
minimal elements for replication and packaging; heterologous DNA encoding a 

10 desired gene product, typically a therapeutic product or plurality of products, 

such as several trophic factors, whose combined activity is effective for treating 
a disorder, such as a retinal degenerative disorder; and the resulting virion 
particles will include a fiber that has a sufficient portion to confer specific 
targeting to photoreceptor cells when the recombinant viral particles are 

15 introduced into the aqueous humor of a mammalian, preferably a human, eye, or 
otherwise contacted with the photoreceptor cells. The fiber may be a chimeric 
protein that has been modified for effective interaction with other coat structural 
proteins, such as penton. In addition, the fiber may be modified to include other 
elements that alter its tropism to permit binding to other cells as well (see, e.g., 

20 U.S. Patent Nos. 5,756,086 and 5,543,328, International PCT application No. 
WO 95/26412 and WO 98/44121 and Krasnykh, et al. U Virol. 70: 6839-46, 
1996). 

C. Construction of the viral particles 

1 . Selection of viral genome and fiber protein 

25 Methods for preparing recombinant adenoviral vectors for gene 

product delivery are well known. Preferred among those are the methods 
exemplified herein (see EXAMPLES) and also described in copending U.S. 
application Serial No. 09/482,682 (also filed as International PCT application 
No. PCT/USOO/00265, filed January 14, 2000, which claims priority to U.S. 

30 provisional application Serial No. 60/1 15,920, as does U.S. application Serial 
No. 09/482,682)). 
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As noted, any desired recombinant adenovirus is contemplated for use in 
the methods herein as long as the viral genome is packaged in a capsid that 
includes at least the portion of a fiber protein that provides selective binding to 
photoreceptor cells. This fiber protein is preferably from an adenovirus type D 
5 serotype and is preferably an Ad37 fiber. The fiber protein should retain the 
knob region at the C-terminus ("head domain") from the Ad virus of subgroup D 
that contains the type-specific antigen and is responsible for binding to the cell 
surface receptor. Hence the fiber protein can be a chimeric fiber protein as long 
as it retains a sufficient portion of the type D serotype to specifically or 

10 selectively bind to photoreceptor cells. Generally the portion retained will be all 
or a portion of the knob region. The precise amount of knob region required can 
be determined empirically by including portions thereof and identifying the 
minimum residues from and Ad type D serotype, preferably Ad37, to effect 
selective targeting of a virion packaged with such fiber to photoreceptors in the 

15 eye upon introduction of the packaged virion into the aqueous humor. 

Recombinant adenovirus containing heterologous nucleic acids that 
encode a desired product, such a gene to correct a genetic defect, may be made 
by any methods known to those of skill in the art. The viruses must be 
packaged in a cell line that results in expression of fiber on the particles that 

20 specifically, electively or preferentially targets (binds and results in 

internalization) the viral particle to cells in the eye. The fiber protein from Ad37 
and other Adenoviruses of serotype D that infect the eye effects such targeting. 
The resulting adenovirus particles that express such fiber is administered by 
intraocular injection, subretinal injection, particularly intravitreal injection, or any 

25 means that results in preferential accumulation in photoreceptor cells. 

The family of Adenoviridae includes many members with at least 47 
known serotypes of human adenovirus (Ad1-Ad47) (Shenk, Virology, Chapter 
67, in Fields et al., eds. Lippincott-Raven, Philadelphia, 1996,) as well as 
members of the genus Mastadenovirus including human, simian, bovine, equine, 

30 porcine, ovine, canine and opossum viruses and members of the Aviadenovirus 
genus, including bird viruses, such as CELO. 
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Thus it is contemplated that the methods herein can be applied to any 
recombinant viral vectors derived from any adenovirus species. One of skill in 
the art would have knowledge of the different adenoviruses (see, e.0r.,Shenk, 
Virology, Chapter 67, in Fields et aL, eds. Lippincott-Raven, Philadelphia, 1996,) 
5 and can construct recombinant viruses containing portions of the genome of any 
such virus. 

In the exemplified embodiment, viral particles with Ad37 fiber were 
prepared. Site-directed mutations were made to the Ad37 fiber gene to make 
the tail sequence more closely match that of Ad5 to facilitate Ad37 fiber binding 

10 to the Ad5 penton base. The plasmid for the expression of the Ad37 fiber 

protein, pDV80, contains the CMV promoter, the adenovirus type 5 tripartite 
leader (TPL), and the modified Ad37 fiber gene sequence. Genes of interest, 
such as nucleic acid encoding the fi subunit of cGMP phosphodiesterase (yffPDE), 
/^glucuronidase, rhodopsin, growth factors, anti-cancer agents, growth factor 

15 receptors and other anti-angiogenic agents, and anti-apoptotic agents, can be 
incorporated into these vectors using the methods known to those of skill in the 
art and exemplified herein. 

Known adenovirus vectors, previously constructed for intraocular therapy 
(see, e.g., Bennett et aL (1996) Nature Medicine 2:649-654, which provides an 

20 Ad virus encoding /?PDE for treatment of retinitis pigmentosa; Cayouette et al. 
(1998) Human Gene Therapy 5:423-430, which provides an Ad vector that 
expresses CNTF for treatment of retinitis pigmentosa and other retinal 
degenerative diseases; and Li et aL (1995) Proc. Natl. Acad. Sci. U.S.A. 
52:7700-7704, which provides an Ad virus vector that encodes a human 0- 

25 glucuronidase for treatment of lysosomal storage disease caused by /?- 

glucuronidase deficiency) can be modified by repackaging the recombinant 
genome using a packaging line that expresses an Ad37 fiber or other D serotype 
fiber. 

For exemplification, nucleic acid encoding GFP was incorporated into 
30 these vectors as a means to visualize their localization. Other genes, such as 

genes that encode therapeutic products, my be included in place of or in addition 
to GFP. 
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Plasmid pDV80 was electroporated into E1-2a S8 cells and stable lines 
were selected. The fiber-deleted vectors Ad5.yffgal.AF and Ad5.GFP.AF were 
grown in cells in a resulting cell line, designated 705, to produce virions, which 
express the Ad37 fiber (Ad5.jffgal.AF/37F and Ad5.GFP.AF/37F) and CsCI- 
5 purified. These virions selectively transduce photoreceptor cells when injected 
intraocularly into the vitreous humor. 

2. Packaging 

Recombinant adenoviral vectors generally have at least a deletion in the 
first viral early gene region, referred to as E1, which includes the E1a and E1b 

10 regions. Deletion of the viral E1 region renders the recombinant adenovirus 

defective for replication and incapable of producing infectious viral particles in 
subsequently-infected target cells. Thus, to generate E1 -deleted adenovirus 
genome replication and to produce virus particles requires a system of 
complementation which provides the missing E1 gene product. E1 

15 complementation is typically provided by a cell line expressing E1, such as the 
human embryonic kidney packaging cell line, i.e. an epithelial cell line, called 
293. Cell line 293 contains the E1 region of adenovirus, which provides E1 
gene region products to "support" the growth of E1 -deleted virus in the cell line 
(see, e.g., Graham et aL, J. Gen. Virol. 36: 59-71, 1977). Additionally, cell lines 

20 that may be usable for production of defective adenovirus having a portion of 
the adenovirus E4 region have been reported (WO 96/22378). 

Multiply deficient adenoviral vectors and complementing cell lines have 
also been described (WO 95/34671, U.S. Patent No. 5,994,106). 

Copending U.S. application Serial No. 09/482,682 (also filed as 

25 International PCT application No. PCT/USOO/00265, filed January 14, 2000)) 
provides packaging cell lines that support viral vectors with deletions of major 
portions of the viral genome, without the need for helper viruses and also 
provides cell lines and helper viruses for use with helper-dependent vectors. 
The packaging cell line has heterologous DNA stably integrated into the 

30 chromosomes of the cellular genome. The heterologous DNA sequence encodes 
one or more adenovirus regulatory and/or structural polypeptides that 
complement the genes deleted or mutated in the adenovirus vector genome to 



WO 01/83729 



PCT/EP01/04863 



-26- 

be replicated and packaged. The packaging cell line express, for example, one 
or more adenovirus structural proteins, polypeptides, or fragments thereof, such 
as penton base, hexon, fiber, polypeptide Ilia, polypeptide V, polypeptide VI, 
polypeptide VII, polypeptide VIII, and biologically active fragments thereof. The 
5 expression can be constitutive or under the control of a regulatable promoter. 
These cell lines are designed for expression of recombinant adenoviruses 
intended for delivery of therapeutic products. 

Particular packaging cell lines complement viral vectors having a deletion 
or mutation of a DNA sequence encoding an adenovirus structural protein, 
10 regulatory polypeptides E1A and E1B, and/or one or more of the following 

regulatory proteins or polypeptides: E2A, E2B, E3, E4, L4, or fragments thereof. 

The packaging cell lines are produced by introducing each DNA molecule 
into the cells and then into the genome via a separate complementing plasmid or 
plurality of DNA molecules encoding the complementing proteins can be 
15 introduced via a single complementing plasmid. Of interest herein, is a variation 
in which the complementing plasmid includes DNA encoding adenovirus fiber 
protein (or a chimeric or modified variant thereof), from Ad virus of subgroup D, 
such as Ad 37, polypeptide or fragment thereof. 

For therapeutic applications, the delivery plasmid further includes a 
20 nucleotide sequence encoding a foreign polypeptide. Exemplary delivery 

plasmids include, but are not limited to, pDV44, pAE1B>ff-gal and pAE1sp1B. In 
a similar or analogous manner, therapeutic genes may be introduced. 

The cell further includes a complementing plasmid encoding a fiber as 
contemplated herein; the plasmid or portion thereof is integrated into a 
25 chromosome(s) of the cellular genome of the cell. 

In one embodiment, a composition comprises a cell containing first and 
second delivery plasmids wherein a first delivery plasmid comprises an 
adenovirus genome lacking a nucleotide sequence encoding fiber and incapable 
of directing the packaging of new viral particles in the absence of a second 
30 delivery plasmid, and a second delivery plasmid comprises an adenoviral genome 
capable of directing the packaging of new viral particles in the presence of the 
first delivery plasmid. 
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In a variation, the packaging cell line expresses fiber protein or chimeric 
variant thereof from an Ad virus of subgroup D, preferably Ad37, serotype or it 
can be any fiber protein but one that has been modified to include the portion of 
the Ad virus of subgroup D, such as Ad37, responsible for selective targeting to 
5 photoreceptors upon introduction into the vitreous humor of the eye of a 
mammal, preferably a human. The fiber protein can be further modified to 
include a non-native amino acid residue sequence that targets additional specific 
receptors. In all instances, the modification should not disrupt trimer formation 
or transport of fiber into the nucleus. In another variation, the non-native amino 

10 acid residue sequence alters the binding specificity of the fiber for a targeted cell 
type. The structural protein is fiber can include amino acid residue sequences 
from more than one adenovirus serotype. The nucleotide sequences encoding 
fiber protein or polypeptide need not be modified solely at one or both termini; 
fiber protein, may be modified "internally" as well as at the termini. 

15 Additional nucleic acid fragments can encode polypeptides that are added 

to the fiber protein. In one variation, the non-native amino acid residue 
sequence is coupled to the carboxyl terminus of the fiber. In another, the non- 
native amino acid residue sequence further includes a linker sequence. 
Alternatively, the fiber protein further comprises a ligand coupled to the linker. 

20 Suitable ligands include, but are not limited to, ligands that specifically or 

' selectively bind to a cell surface receptor and ligands that can be used to couple 
other proteins or nucleic acid molecules. Typically, the packaging cell lines will 
contain nucleic acid encoding the fiber protein or modified protein stably 
integrated into a chromosome or chromosomes in the cellular genome. 

25 The packaging cell line can be derived from a procaryotic cell line or from 

a eukaryotic cell line. While various embodiments suggest the use of mammalian 
cells, and more particularly, epithelial cell lines, a variety of other, non-epithelial 
cell lines are used in various embodiments. Thus, while various embodiments 
disclose the use of a cell line selected from among the 293, A549, W1 62, HeLa, 

30 Vero, 21 1, and 21 1 A cell lines, and any other cell lines suitable for such use are 
likewise contemplated herein. 

3. Components of the nucleic acid molecule included In the particle 
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A recombinant viral vector or therapeutic viral vector for use in the 
methods herein, typically includes a nucleic acid fragment that encodes a 
protein or polypeptide molecule, or a biologically active fragment thereof, or 
other regulatory sequence, that is intended for use in therapeutic applications. 
5 The nucleic acid molecule to be packaged in the viral particle also may 

include an enhancer element and/or a promoter located 3' or 5' to and 
controlling the expression of the therapeutic product-encoding nucleic acid 
molecule if the product is a protein. Further, for purposes herein, the promoter 
and/or other transcriptional and translational regulatory sequences controlling 

10 expression of the product is preferably one that is expressed specifically in the 
targeted cells, such as the a photoreceptor-specific promoter, such as a 
rhodopsin gene promoter. 

The nucleic acid molecule to be packaged in viral capsid includes at least 
2 different operatively linked DNA segments. The DNA can be manipulated and 

15 amplified by PGR as described herein and by using standard techniques, such as 
those described in Molecular Cloning: A Laboratory Manual, 2nd Ed., Sambrook 
et aL, eds., Cold Spring Harbor, New York (1989). Typically, to produce such 
molecule, the sequence encoding the selected polypeptide and the promoter or 
enhancer are operatively linked to a DNA molecule capable of autonomous 

20 replication in a cell either in vivo or in vitro. By operatively linking the enhancer 
element or promoter and nucleic acid molecule to the vector, the attached 
segments are replicated along with the vector sequences. 

Thus, the recombinant DNA molecule (rDNA) is a hybrid DNA molecule 
comprising at least 2 nucleotide sequences not normally found together in 

25 nature. In various preferred embodiments, one of the sequences is a sequence 
encoding an Ad-derived polypeptide, protein, or fragment thereof. The nucleic 
acid molecule intended to be packaged is from about 20 base pairs to about 
40,000 base pairs in length, preferably about 50 bp to about 38,000 bp in 
length. In various embodiments, the nucleic acid molecule is of sufficient length 

30 to encode one or more adenovirus proteins or functional polypeptide portions 
thereof. Since individual Ad polypeptides vary in length from about 1 9 amino 
acid residues to about 967 amino acid residues, encoding nucleic acid molecules 
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from about 50 bp up to about 3000 bp, depending on the number and size of 
individual polypeptide-encoding sequences that are "replaced" in the viral 
vectors by therapeutic product-encoding nucleic acid molecules. 

Preferably the molecule includes an adenovirus tripartite leader (TPL) 
5 nucleic acid sequence operatively linked to an intron containing RNA processing 
signals (such as for example, splice donor or splice acceptor sites) suitable for 
expression in the packaging cell line. Most preferably the intron contains a splice 
donor site and a splice acceptor site. Alternatively, the TPL nucleotide sequence 
may not comprise an intron. The intron includes any sequence of 

10 nucleotides that function in the packaging cell line to provide RNA processing 
signals, including splicing signals. Introns have been well characterized from a 
large number of structural genes, and include but are not limited to a native 
intron 1 from adenovirus, such as Ad5's TPL intron 1; others include the SV40 
VP intron; the rabbit beta-globin intron, and synthetic intron constructs (see, 

15 e. g. r Petitclerc eta/. (1995) J. Biothechnol., 40:169; and Choi et al. (19910 MoL 
Cell. Biol., 1 7:3070). 

The nucleic acid molecule encoding the TPL includes either (a) first and 
second TPL exons or (b) first, second and third TPL exons, where each TPL exon 
in the sequence is selected from among the complete TPL exon 1 , partial TPL 

20 exon 1, complete TPL exon 2 and complete TPL exon 3. A complete exon is 
one which contains the complete nucleic acid sequence based on the sequence 
found in the wild type viral genome. Preferably the TPL exons are from Ad2, 
Ad3, Ad5, Ad7 and the like, however, they may come from any Ad serotype, as 
described herein. A preferred partial TPL exon 1 is described in the Examples. 

25 The use of a TPL with a partial exon 1 has been reported (International PCT 
application No. WO 98/13499). 

The intron and the TPL exons can be operatively linked in a variety of 
configurations to provide a functional TPL nucleotide sequence. An intron may 
not be a part of the construct. For example, the intron can be positioned 

30 between any of TPL exons 1 , 2 or 3, and the exons can be in any order of first 
and second, or first/second/third. The intron can also be placed preceding the 
first TPL exon or following the last TPL exon. In a preferred embodiment, 
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complete TPL exon 1 is operatively linked to complete TPL exon 2 operatively 
linked to complete TPL exon 3. In a preferred variation, adenovirus TPL intron 1 
is positioned between complete TPL exon 1 and complete TPL exon 2. It may 
also be possible to use analogous translational regulators from other viral 
5 systems such as rabiesvirus. 

A preferred "complete" TPL nucleic acid molecule containing complete 
TPL exons 1 , 2 and 3 with adenovirus intron 1 inserted between exons 1 and 2 
has a nucleotide sequence shown in SEQ ID NO: 32. A preferred "partial" TPL 
nucleic acid molecule containing partial TPL exon 1 and complete TPL exons 2 
10 and 3 in that order has a nucleotide sequence shown in SEQ ID NO: 26. The 
construction of these preferred TPL nucleotide sequences is described in the 
Examples. 

Thus, preferred expression cassettes and complementing plasmids for 
expressing adenovirus structural genes, particularly fiber protein, contain an 
15 adenovirus TPL nucleotide sequence as described herein. 
4. Complementing Plasmids 

Also contemplated are the use of nucleic acid molecules, typically in the 
form of DNA plasmid vectors, which are capable of expression of an adenovirus 
structural protein or regulatory protein. Because these expression plasmids are 
20 used to complement the defective genes of a recombinant adenovirus vector 
genome, the plasmids are referred to as complementing or complementation 
plasmids. 

The complementing plasmid contains an expression cassette, a nucleotide 
sequence capable of expressing a protein product encoded by the nucleic acid 

25 molecule. Expression cassettes typically contain a promoter and a structural 

gene operatively linked to the promoter. The complementing plasmid can further 
include a sequence of nucleotides encoding TPL nucleotide to enhance 
expression of the structural gene product when used in the context of 
adenovirus genome replication and packaging. 

30 A complementing plasmid can include a promoter operatively linked to a 

sequence of nucleotides encoding an adenovirus structural polypeptide, such as, 
but are not limited to, penton base; hexon; fiber; polypeptide Ilia; polypeptide V; 
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polypeptide VI; polypeptide VII; polypeptide VIII; and biologically active 
fragments thereof. In another variation, a complementing plasmid may also 
include a sequence of nucleotides encoding a first adenovirus regulatory 
polypeptide, a second regulatory polypeptide, and/or a third regulatory 
5 polypeptide, and any combination of the foregoing. 

Plasmid pDV80 is a preferred plasmid herein. Other plasmids constructed 
in an analogous manner to encode modified fiber proteins and chimeric fiber 
proteins are also contemplated herein. 

5. Nucleic Acid Molecule Synthesis 

10 A nucleic acid molecule comprising synthetic oligonucleotides can be 

prepared using any suitable method, such as the phosphotriester or 
phosphodiester methods (see, e.g., Narang (1979) et al., Meth. EnzymoL, 
68:90; U.S. Patent No. 4,356,270; and Brown et aL, (1979) Meth. EnzymoL, 
68:109). For oligonucleotides, the synthesis of the family members can be 

15 conducted simultaneously in a single reaction vessel, or can be synthesized 

independently and later admixed in preselected molar ratios. For simultaneous 
synthesis, the nucleotide residues that are conserved at preselected positions of 
the sequence of the family member can be introduced in a chemical synthesis 
protocol simultaneously to the variants by the addition of a single preselected 

20 nucleotide precursor to the solid phase oligonucleotide reaction admixture when 
that position number of the oligonucleotide is being chemically added to the 
growing oligonucleotide polymer. The addition of nucleotide residues to those 
positions in the sequence that vary can be introduced simultaneously by the 
addition of amounts, preferably equimolar amounts, of multiple preselected 

25 nucleotide precursors to the solid phase oligonucleotide reaction admixture 
during chemical synthesis. For example, where all four possible natural 
nucleotides (A,T,G and C) are to be added at a preselected position, their 
precursors are added to the oligonucleotide synthesis reaction at that step to 
simultaneously form four variants (see, e.g., Ausubel et al. (Current Protocols in 

30 Molecular Biology, SuppL 8. p.2.1 1 .7, John Wiley & Sons, Inc., New York 
,1991). 
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Nucleotide bases other than the common four nucleotides (A,T,G or C), or 
the RNA equivalent nucleotide uracil (U), can also be used. For example, it is 
well known that inosine (I) is capable of hybridizing with A, T and G, but not C. 
Examples of other useful nucleotide analogs are known in the art and may be 
5 found referred to in 37 C.F.R. §1.822. 

Thus, where all four common nucleotides are to occupy a single position 
of a family of oligonucleotides, that is, where the preselected nucleotide 
sequence is designed to contain oligonucleotides that can hybridize to four 
sequences that vary at one position, several different oligonucleotide structures 

10 are contemplated. The composition can contain four members, where a 
preselected position contains A,T,G or C. Alternatively, a composition can 
contain two nucleotide sequence members, where a preselected position 
contains I or C, and has the capacity to hybridize at that position to all four 
possible common nucleotides. Finally, other nucleotides may be included at the 

1 5 preselected position that have the capacity to hybridize in a non-destabilizing 
manner with more than one of the common nucleotides in a manner similar to 
inosine. 

Similarly, larger nucleic acid molecules can be constructed in synthetic 
oligonucleotide pieces, and assembled by complementary hybridization and 
20 ligation, as is well known. 

D. Adenovirus Expression Vector Systems 

The adenovirus vector genome that is encapsulated in the virus particle 
and that expresses exogenous genes in a gene therapy setting is a key 
component of the system. Thus, the components of a recombinant adenovirus 
25 vector genome include the ability to express selected adenovirus structural 
genes, to express a desired exogenous protein, and to contain 

sufficient replication and packaging signals that the genome is 
packaged into a gene delivery vector particle. The preferred 
replication signal is an adenovirus inverted terminal repeat 
30 containing an adenovirus origin of replication, as is well known 

and described herein. 
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Although adenovirus include many proteins, not all adenovirus proteins 
are required for assembly of a recombinant adenovirus particle (vector). Thus, 
deletion of the appropriate genes from a recombinant Ad vector permits 
accommodation of even larger "foreign" DNA segments. 
5 A preferred recombinant adenovirus vector genome is "helper 

independent" so that genome can replicate and be packaged without the help of 
a second, complementing helper virus. Complementation is provided by a 
packaging cell. 

In a preferred embodiment, the adenovirus vector genome does not 
10 encode a functional adenovirus fiber protein. A non-functional fiber gene refers 
to a deletion, mutation or other modification to the adenovirus fiber gene such 
that the gene does not express any or insufficient adenovirus fiber protein to 
package a fiber-containing adenovirus particle without complementation of the 
fiber gene by a complementing plasmid or packaging cell line. Such a genome is 
15 referred to as a "fiberless" genome, not to be confused with a fiberless particle. 
Alternatively, a fiber protein may be encoded but is insufficiently expressed to 
result in a fiber containing particle. 

Thus, contemplated for use are helper-independent fiberless recombinant 
adenovirus vector genomes that include genes that (a) express all adenovirus 
20 structural gene products but express insufficient adenovirus fiber protein to 

packagie a fiber-containing adenovirus particle without complementation of said 
fiber gene, (b) express an exogenous protein, and (c) contain an adenovirus 
packaging signal and inverted terminal repeats containing adenovirus origin of 
replication. 

25 The adenovirus vector genome is propagated in the laboratory in the form 

of rDNA plasmids containing the genome, and upon introduction into an 
appropriate host, the viral genetic elements provide for viral genome replication 
and packaging rather than plasmid-based propagation. Exemplary methods for 
preparing an Ad-vector genome are described in the Examples. 

30 A vector herein includes a nucleic acid (preferably DNA) molecule capable 

of autonomous replication in a cell and to which a DNA segment, e.g., a gene or 
polynucleotide, can be operatively linked to bring about replication of the 
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attached segment. For purposes herein, one of the nucleotide segments to be 
operatively linked to vector sequences encodes at least a portion of a therapeutic 
nucleic acid molecule. As noted above, therapeutic nucleic acid molecules 
include those encoding proteins and also those that encode regulatory factors 
5 that can lead to expression or inhibition or alteration of expression of a gene 
product in a targeted cell. 

1 . Nucleic Acid Gene Expression Cassettes 
In various embodiments, a peptide-coding sequence of the therapeutic 
gene is inserted into an expression vector and expressed; however, it is also 

10 feasible to construct an expression vector which also includes some non-coding 
sequences as well. Preferably, however, non-coding sequences are excluded. 
Alternatively, a nucleotide sequence for a soluble form of a polypeptide may be 
utilized. Another preferred therapeutic viral vector includes a nucleotide 
sequence encoding at least a portion of a therapeutic nucleotide sequence 

15 operatively linked to the expression vector for expression of the coding sequence 
in the therapeutic nucleotide sequence. 

The choice of viral vector into which a therapeutic nucleic acid molecule 
is operatively linked depends directly, as is well known in the art, on the 
functional properties desired, e.g., vector replication and protein expression, and 

20 the host cell to be transformed - these being limitations inherent in the art of 
constructing recombinant DNA molecules. Although certain adenovirus 
serotypes are recited herein in the form of specific examples, it should be 
understood that the use of any adenovirus serotype, including hybrids and 
derivatives thereof are contemplated. 

25 A translatable nucleotide sequence is a linear series of nucleotides that 

provide an uninterrupted series of at least 8 codons that encode a polypeptide in 
one reading frame. Preferably, the nucleotide sequence is a DNA sequence. The 
vector itself may be of any suitable type, such as a viral vector (RNA or DNA), 
naked straight-chain or circular DNA, or a vesicle or envelope containing the 

30 nucleic acid material and any polypeptides that are to be inserted into the cell. 
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2- Promoters 

As noted elsewhere herein, an expression nucleic acid in an Ad-derived 
vector may also include a promoter, particularly a tissue or cell specific 
promoter, preferably one expressed in ocular cells, particularly photoreceptors. 
5 Promoters contemplaged for use herein include regulatable (inducible) as 

well as constitutive promoters, which may be used, either on separate vectors or 
on the same vector. Some useful regulatable promoters are those of the CREB- 
regulated gene family and include inhibin, gonadotropin, cytochrome c, 
glucagon, and the like. (See, e.g., International PCT application No. WO 

10 96/14061). Preferably the promoter selected is from a photoreceptor-specific 
gene, such as a rhodopsin gene or gene that encodes a protein that regulates 
rhodopsin expression. 
E. Formulation and administration 

Compositions containing therapeutically effective concentrations of 

15 recombinant adenovirus delivery vectors are provided. These are for delivery of 
therapeutic gene products to cells, particularly cells express a particular 50 kDa 
receptor or other receptor with which the vectors interact. These cells include 
cells of the eye and genital tract. Of particular interest are photoreceptor cells of 
the eye. Administration is effected by any means through which contacting with 

20 the photoreceptors is effected. Preferable modes of administration include, but 
are not limited to, subretinal injection, particularly intravitreal injection, to provide 
access to photoreceptor cells. 

The recombinant viral compositions may also be formulated for 
implantation into the anterior or posterior chamber of the eye, preferably the 

25 vitreous cavity, in sustained released formulations, such as those adsorbed to 
biodegradable supports, including collagen sponges, or in liposomes. Sustained 
release formulations may be formulated for multiple dosage administration, so 
that during a selected period of time, such as a month or up to about a year, 
several dosages are administered. Thus, for example, liposomes may be 

30 prepared such that a total of about two to up to about five or more times the 
single dosage is administered in one injection. 
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The vectors are formulated in an ophthalmologically acceptable carrier for 
intraocular, preferably intravitreal, administration in a volume of between about 
0.05 ml and 0.150 ml, preferably about 0.05 and 0.100 ml. 

The composition can be provided in a sealed sterile vial containing an 
5 amount of a compound of formula I, that upon intraocular administration will 
deliver a sufficient amount of viral particles to the photoreceptors in a volume of 
about 50 to 150 //I, containing at least about 10 7 , more preferably at least about 
10 8 plaque forming units in such volume. Typically, the vials will, thus, contain 
about 0.150 ml of the composition. 

10 To prepare compositions the viral particles are dialzyed into a suitable 

ophthalmologically acceptable carrier or viral particles, for example, may be 
concentrated and/or mixed therewith. The resulting mixture may be a solution, 
suspension or emulsion. In addition, the viral particles may be formulated as 
the sole pharmaceutical^ active ingredient in the composition or may be 

15 combined with other active agents for the particular disorder treated. 

For administration by intraocular injection or via eyedrops, suitable 
carriers include, but are not limited to, physiological saline, phosphate buffered 
saline (PBS), balanced salt solution (BSS),. lactate Ringers solution, and solutions 
containing thickening and solubilizing agents, such as glucose, polyethylene 

20 glycol, and polypropylene glycol and mixtures thereof. Liposomal suspensions 
may also be suitable as pharmaceutically acceptable carriers. These may be 
prepared according to methods known to those skilled in the art. Suitable 
ophthalmologically acceptable carriers are known. Solutions or mixtures 
intended for ophthalmic use may be formulated as 0.01% - 10% isotonic 

25 solutions, pH about 5-7, with appropriate salts [see, e.g. , U.S. Patent No. 
5,1 16,868, which describes typical compositions of ophthalmic irrigation 
solutions and solutions for local application]. Such solutions, which have a pH 
adjusted to about 7.4, contain, for example, 90-100 mM sodium chloride, 4-6 
mM dibasic potassium phosphate, 4-6 mM dibasic sodium phosphate, 8-12 mM 

30 sodium citrate, 0.5-1.5 mM magnesium chloride, 1.5-2.5 mM calcium chloride, 
15-25 mM sodium acetate, 10-20 mM D.L.-sodium £-hydroxybutyrate and 5-5.5 
mM glucose. 
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The compositions may be prepared with carriers that protect them from 
rapid elimination from the body, such as time release formulations or coatings. 
Such carriers include controlled release formulations, such as, but not limited to, 
microencapsulated delivery systems, and biodegradable, biocompatible polymers, 
5 such as ethylene vinyl acetate, polyanhydrides, polyglycolic acid, 

polyorthoesters, polylactic acid and other types of implants that may be placed 
directly into the anterior or posterior chamber or vitreous cavity of the eye. The 
compositions may also be administered in pellets, such as Elvax pellets 
(ethylene-vinyl acetate copolymer resin). 

10 Liposomal suspensions, including tissue-targeted liposomes, may also be 

suitable as pharmaceutically acceptable carriers. For example, liposome 
formulations may be prepared by methods known to those of skill in the art [see, 
e.g. , Kimm et aL (1 983) Bioch. Bioph. Acta 728 :339-398; Assil et aL (1 987) 
Arch Ophthalmol. 105 :400; and U.S. Patent No. 4,522,81 1]. The viral particles 

15 may be encapsulated into the aqueous phase of liposome systems. 

The active materials can also be mixed with other active materials, that 
do not impair the desired action, or with materials that supplement the desired 
action or have other action, including viscoelastic materials, such as hyaluronic 
acid, which is sold under the trademark HEALON, which is a solution of a high 

20 molecular weight (MW) of about 3 millions fraction of sodium hyaluronate 
[manufactured by Pharmacia, Inc; see, e.g. , U.S. Patent Nos. 5,292,362, 
5,282,851, 5,273,056, 5,229,127, 4,517,295 and 4,328,803], VISCOAT 
[fluorine-containing (meth)acrylates, such as, 1 H,1 H,2H,2H-hepta- 
decafluorodecylmethacrylate; see, e.g. , U.S. Patent Nos. 5,278,126, 5,273,751 

25 and 5,214,080; commercially available from Alcon Surgical, Inc.], ORCOLON 
[see, e.g. , U.S. Patent No. 5,273,056; commercially available from Optical 
Radiation Corporation], methylcellulose, methyl hyaluronate, polyacrylamide and 
polymethacrylamide [see, e.g. , U.S. Patent No. 5,273,751], The viscoelastic 
materials are present generally in amounts ranging from about 0.5 to 5.0%, 

30 preferably 1 to 3% by weight of the conjugate material and serve to coat and 
protect the treated tissues. The compositions may also include a dye, such as 
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methylene blue or other inert dye, so that the composition can be seen when 
injected into the eye. Additional active agents may be included. 

The compositions can be enclosed in ampules, disposable syringes or 
multiple or single dose vials made of glass, plastic or other suitable material. 
5 Such enclosed compositions can be provided in kits, In particular, kits 

containing vials, ampules or other containers, preferably disposable vials with 
sufficient amount of the composition to deliver about 0.100 ml thereof, and 
disposable needles, preferably self sealing 25-30 gauge needles, are provided 
herein. 

10 Finally, the compounds may be packaged as articles of manufacture 

containing packaging material, typically a vial, an ophthalmologically acceptable 
composition containing the viral particles and a label that indicates the 
therapeutic use of the composition. 

Also provided are kits for practice of the methods herein. The kits 

15 contain one or more containers, such as sealed vials, with sufficient composition 
for single dosage administration, and one or more needles, such as self sealing 
25-33 gauge needles, preferably 33 gauge or smaller needles, precisely 
calibrated syringes or other precisely calibrated delivery device, suitable for 
intravitreal injection. 

20 Administration of the composition is preferably by intraocular injection, 

although other modes of administration may be effective, if the sufficient 
amount of the compound achieves contact with the vitreous cavity. Intraocular 
injection may be effected by intravitreal injection, aqueous humor injection or 
injection into the external layers of the eye, such as subconjunctival injection or 

25 subtenon injection, or by topical application to the cornea, if a penetrating 
formulation is used. 

Administration 

The compositions containing the compounds are administered 
intraocularly or by other means, such as topically in the form of penetrating 
30 eyedrops, whereby contact of the recombinant vectors with the aqueous humor 
is effected. Intraocular administration may be effected by intravitreal injection, 
aqueous humor injection, injection into the external layers of the eye, such as 
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subconjunctival injection or subtenon injection, preferably in free form, but, 
alternatively, in liposomes or other sustained drug delivery device. 
Administration is preferably by intravitreal injection, preferably through self 
sealing 25-30 gauge needles or other suitably calibrated delivery device. 
5 Injection into the eye may be through the pars plana via the self-sealing needle. 

It is further understood that, for any particular subject, specific dosage 
regimens should be adjusted over time according to the individual need and the 
professional judgment of the person administering or supervising the 
administration of the recombinant viruses, and that the concentration ranges set 
10 forth herein are exemplary only and are not intended to limit the scope or 
practice of the claimed methods 
F. Diseases, Disorders and therapeutic products 
1 . Disease and disorders 

Retinitis pigmentosa 

1 5 Methods for specifically or selectively targeting recombinant adenovirus 

vectors for delivery of gene products, particularly therapeutic products are 
provided herein. These methods are particularly suitable for targeting cells that 
express receptors that are selectively recognized by Ad virus of subgroup D 
viruses, particularly Ad37. It is shown herein that these viruses selectively 

20 recognize receptors on cells, such as conjunctival cells and photoreceptors, that 
are not recognized by other adenoviruses. Hence, methods for targeting to 
these cell types by providing vectors that are packaged in viral particles that 
contain a sufficient portion of a fiber protein from one of these Ad serotypes to 
bind to these receptors. These methods are useful for targeting to 

25 photoreceptors and for treating ocular disorders, including, but are not limited to, 
inherited and acquired retinal, neovascular degenerative diseases (see table 
below). 

It is estimated that 1 in 3,500 individuals in the United States suffer from 
one of the pigmented retinopathies. This group of retinal diseases, commonly 
30 called retinitis pigmentosa, is characterized by progressive loss of peripheral and 
night vision. Patients may be affected at almost any age and it is not 
uncommon to experience symptoms in early childhood in certain inherited forms. 
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It has been shown that there are a variety of mutations in genes expressed in 
the photoreceptors, including genes in the rhodopsin gene and pathway that 
appear to be responsible for these diseases. In addition to mutations in 
rhodopsin, changes in the retinal pigmented epithelial (RPE) cells, also undergo 
5 degenerative changes and can form clumps of pigment that give rise to the 
characteristic pigmentary changes seen in patients with RP, 
Angiogenesis and ocular diseases and disorders 
The vast majority of diseases that cause catastrophic loss of vision do so 
as a result of ocular neovascularization; age related macular degeneration 

10 (ARMD) affects 12-15 million American over the age of 65 and causes visual 
loss in 10-15% of them as a direct effect of choroidal (sub-retinal) 
neovascularization. The leading cause of visual loss for Americans under the 
age of 65 is diabetes; 1 6 million individuals in the United States are diabetic and 
40,000 per year suffer from ocular complications of the disease, which often are 

15 a result of retinal neovascularization. Laser photocoagulation has been effective 
in preventing severe visual loss in subgroups of high risk diabetic patients, but 
the overall 10 year incidence of retinopathy remains essentially unchanged. For 
patients with choroidal neovascularization due to ARMD or inflammatory eye 
disease, such as ocular histoplasmosis, photocoagulation, with few exceptions, 

20 is ineffective in preventing visual loss. While recently developed, non- 
destructive photodynamic therapies hold promise for temporarily reducing 
individual loss in patients with previously untreatable choroidal 
neovascularization, only 61.4% of patients treated every 3-4 months had 
improved or stabilized vision compared to 45.9% of the placebo-treated group. 

25 In the normal adult, angiogenesis is tightly regulated and limited to wound 

healing, pregnancy and uterine cycling. Angiogenesis is turned on by specific 
angiogenic molecules such as basic and acidic fibroblast growth factor (FGF), 
vascular endothelial growth factor (VEGF), angiogenin, transforming growth 
factor (TGF), tumor necrosis factor-a (TNF-cr) and platelet derived growth factor 

30 (PDGF). Angiogenesis can be suppressed by inhibitory molecules such as 

interferon-a, thrombospondin-1 , angiostatin and endostatin. It is the balance of 
these naturally occurring stimulators and inhibitors that controls the normally 



WO 01/83729 



PCT/EP01/04863 



-41- 

quiescent capillary vasculature. When this balance is upset, as in certain disease 
states, capillary endothelial cells are induced to proliferate, migrate and 
ultimately differentiate. 

Angiogenesis plays a central role in a variety of diseases, including, but 
5 are not limited to, cancer and ocular neovascularization. Sustained growth and 
metastasis of a variety of tumors has also been shown to be dependent on the 
growth of new host blood vessels into the tumor in response to tumor derived 
angiogenic factors. Proliferation of new blood vessels in response to a variety of 
stimuli occurs as the dominant finding in the majority of eye diseases that blind, 

10 such as, but are not limited to, proliferative diabetic retinopathy (PDR), ARMD, 
rubeotic glaucoma, interstitial keratitis and retinopathy of prematurity. In these 
diseases, tissue damage can stimulate release of angiogenic factors resulting in 
capillary proliferation. VEGF plays a dominant role in iris neovascularization and 
neovascular retinopathies. While reports clearly show a correlation between 

15 intraocular VEGF levels and ischemic retinopathic ocular neovascularization, FGF 
likely plays a role. Basic and acidic FGF are known to be present in the normal 
adult retina, even though detectable levels are not consistently correlated with 
neovascularization. This may be largely due to the fact that FGF binds very 
tightly to charged components of the extracellular matrix and may not be readily 

20 available in a freely diffusible form that would be detected by standard assays of 
intraocular fluids. 

A final common pathway in the angiogenic response involves integrin- 
mediated information exchange between a proliferating vascular endothelial cell 
and the extracellular matrix. This class of adhesion receptors, called integrins, 

25 are expressed as heterodimers having an a and fi subunit on all cells. One such 
integrin, or v /? 3 , is the most promiscuous member of this family and allows 
endothelial cells to interact with a wide variety of extracellular matrix 
components. Peptide and antibody antagonists of this integrin inhibit 
angiogenesis by selectively inducing apoptosis of the proliferating vascular 

30 endothelial cells. Two cytokine-dependent pathways of angiogenesis exist and 
may be defined by their dependency on distinct vascular cell integrins, a v ff 3 and 
or v /? 5 . Specifically, basic FGF- and VEGF-induced angiogenesis depend on integrin 
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or v /? 3 and a v /? 5 , respectively, since antibody antagonists of each integrin 
selectively block one of these angiogenic pathways in the rabbit corneal and 
chick chorioallantoic membrane (CAM) models. Peptide antagonists that block 
all a v integrins inhibit FGF- and VEGF-stimulated angiogenesis. While normal 
5 human ocular blood vessels do not display either integrin, a v 0 3 and a v /? 5 integrins 
are selectively displayed on blood vessels in tissues from patients with active 
neovascular eye disease. While only a v 0 3 was consistently observed in tissue 
from patients with ARMD,a v /? 3 and a v 0 s were present in tissues from patients 
with PDR. Systemically administered peptide antagonists of integrins blocked 

10 new blood vessel formation in a mouse model of retinal vasculogenesis. 

In addition to adhesion events described above, cell migration through the 
extracellular matrix also depends on proteolysis. Matrix metalloproteinases are a 
family of zinc-requiring matrix-degrading enzymes that include the collagenases, 
gelatinases and stromelysins, all of which have been implicated in invasive cell 

15 behavior. Invasive cell processes such as tumor metastasis and angiogenesis 
have been found to be associated with the expression of integrins and MMP-2, 
MMP-2 are all found throughout the eye where they may interact to maintain a 
quiescent vasculature until the balance is upset, resulting in pathological 
angiogenesis. A non-catalytic C-terminal hemopexin-like domain of MMP-2 

20 (PEX) can block cell surface collagenolytic activity and inhibit angiogenesis in the 
CAM model by preventing localization of MMP-2 to the surface of invasive cells 
through interaction with the integrin agff 3 . 

Hence, anti-angiogenic agents have a role in treating retinal degeneration 
to prevent the damaging effects of these trophic and growth factors. 

25 Angiogenic agents, also have a role in promoting desirable vascularization to 
retard retinal degeneration by enhancing blood flow to cells. 

Members of adenovirus subgroup D, Ad8, 19A, and 37, are infectious 
agents that cause particularly severe cases of epidemic keratoconjunctivitis 
(EKC) (Arnberg et al. (1998) Virology 227:239-244; Curtis et al. (1998) 

30 J.Med.Microbiol. 47:91-94; Ritterband et al. (1998) Rev. Med. Virol. 5:187-201; 
and Takeuchi et al. (1999) J.CIin.Microbiol. 37:3392-3394). There is no 
effective treatment for this debilitating and contagious disease and EKC 
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continues to be a problem in ophthalmology clinics worldwide (Curtis et al. 
(1998) J.Med.Microbiol. 47:91-94, Lukashok et al. (1998) 

Curr.Clin.Top.lnfec.Dis. 75:286-304). Hence the vectors herein may be used for 
treating the disease. 

Table 3 

Candidate targets for ocular disease therapy 



CANDIDATE TARGETS FOR OCULAR DISEASE THERAPY 


Disease 


Candidate target(s) 


Retinitis pigmentosa 


Rhodopsin gene, and genes that 
regulate expression thereof 
rate/peripherin 


Stargardt's disease 


rim protein (ARC protein) 






Choroideremia 


rab geranylgeranyl transferase 
CHM, TCD, CHML* 


Gyrate Atrophy 


ornithine aminotransferase 


Macular dystrophy 


rcfe/peripherin 



* see, "MSR6-yeast homologue of the choroideraemia gene," Nature Genetics 
3: 193-4 (1993) 

TABLE 4 



Other Diseases 



Exudative Choroidal Diseases 



ICSC, fluorescein angiogram 



ICSC with large serious detachment of RPE (retinal pigmented epithelium) 



ICSC with bullous retinal detachment 



Macular drusen, exudative, confluent 



Drusen, sub-RPE choroidal neovascularization 



Drusen, notched serous detachment of RPE 
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Other Diseases 



Drusen, notched serous and hemorrhagic detachment of RPE 
Drusen, serous and hemorrhagic detachment of RPE and retina 
Drusen, organized RPE detachment causing bullous retinal detachment 
Drusen, geographic atrophy of RPE 

Drusen, exudative and cuticular, vitelliform macular detachment 

Drusen, cuticular, large vitelliform macular detachment 

North Carolina dystrophy with macular staphyloma 

North Carolina dystrophy with macular staphyloma 

Angioid streaks, pseudoxanthoma elasticum (PXE), CNVM 

Angioid streaks, PXE, large notched retinal detachment 

Myopic degeneration, Foerster-Fuchs spot 

Presumed ocular histoplasmosis syndrome (POHS) 

Submacular bacterial abscess 

Toxocara cam's, subretinal granuloma 

Serpiginous (geographic) choroiditis 

Posterior scleritis 

Harada's disease 

Posterior sympathetic uveitis 

Benign reactive lymphoid hyperplasia of uveal tract 
Choroidal ruptures and CNVM 
Cavernous hemangioma of choroid 
Choroidal osteoma 

Choroidal nevus, serous macular detachment 

Choroidal nevus with CNVM 

Diffuse sclerochoroidal melanocyte nevus 

Choroidal melanoma with serous detachment of RPE 

Metastatic lung carcinoma to choroid 

Sub-RPE reticulum cell sarcoma 
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Other Diseases 



RPE tear, idiopathic choroidal neovascularization 



Heredodystrophic Disorders Affecting RPE & Retina 



Best's vitelliforrn macular dystrophy 



Best's vitelliforrn macular dystrophy with CNVM 



Best's vitelliforrn macular dystrophy, multiple lesions 



Adult-onset vitelliforrn foveomacular dystrophy 



Pattern dystrophy simulating fundus flavimaculatus 



Stargardt's disease (fundus flavimaculatus) 



Asteroid macular dystrophy 



Sjogren-Larssen syndrome 



Oguchi's disease, light-adapted state 



Oguchi's disease, dark-adapted state 



Fundus albipunctatus 



Retinitis pigmentosa, cystoid macular edema 



Crystalline tapetoretinal dystrophy 



Choroideremia 



Goldmann-Favre syndrome 



Sex-linked juvenile retinoschisis 



Perivenous retinitis pigmentosa 



Retinal Vascular Disorders 



Retinal arteriovenous aneurysm 



Central retinal artery occlusion 



Cilioretinal artery obstruction 



Ischemic retinopathy in systemic lupus erythematosus 



Ischemic retinopathy in scleroderma 



Hemorrhagic detachment of internal limiting membrane, hypertensive 
retinopathy 



Acquired retinal arterial macroaneurysm 
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Other Diseases 



Cystoid macular edema, aphakic 



Cystoid macular edema, nicotinic acid maculopathy 



Congenital retinal telangiectasis 



Acquired bilateral juxtafoveal telangiectasis 



Acquired bilateral juxtafoveal obliterative telangiectasis 



Diabetic optic neuropathy 



X-ray radiation exudative retinopathy 



Sickle cell SC disease, macular hemorrhage 



Retinal arterial aneurysms, arteritis, neuroretinitis 



Branch retinal vein obstruction (BRVO) 



BRVO, exudative maculopathy 



BRVO, optic disc new vessels, photocoagulation 



Waldenstrom's macroglobulinemia 



Inflammatory Diseases of the Retina and Choroid 



Luetic retinal vasculitis 



Focal Candida retinal abscess 



Toxoplasmosis, atrophic chorioretinal scar 



Toxoplasmosis retinitis and macular detachment 



Toxoplasmosis scar, CNVM, macular detachment 



Diffuse unilateral subacute neuroretinitis, small worm 



Diffuse unilateral subacute neuroretinitis, large worm 

Cytomegalic inclusion disease, papillitis 

Acute posterior multifocal placoid pigment epitheliopathy 

Acute macular neuroretinitis 

Sarcoid retinitis 

Sarcoid papillitis 

Behcet's disease 

Vitiliginous (bird-shot) chorioretinitis 
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Other Diseases 



Multifocal choroiditis and panveitis (pseudo-POHS) 



Retinal and Pigment Epithelial Hamartomas 



Congenital grouped albinotic RPE spots 



Congenital hyperplasia of RPE 



Combined RPE and retinal hamartoma, juxtapapillary 



Combined RPE and retinal hamartoma, peripheral 



Cystic astrocytoma, juxtapapillary 



Astrocytoma, macula 



Astrocytoma, juxtapapillary 



Cavernous hemangioma of retina 



Juxtapapillary sessile retinal capillary hemangioma 



Juxtapapillary endophytic retinal capillary hemangioma 



Other Tumors of the Choroid 



Choroidal metastasis 



Choroidal osteoma 



Choroidal hemangioma 



Miscellaneous uveal tumors 



Intraocular Lymphoid Tumors 



The leukemias and lymphomas 



Tumors of the Vitreous 



Non-Hodgkins ("reticulum cell") lymphoma 



Tumor involvement of the vitreous cavity 



Macular Disease 



Age-related macular degeneration — atrophic form 



Exudative age-related macular degeneration 



Choroidal neovascular membrane in degenerative myopia 



Central serous retinopathy 



Macular hole 
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Other Diseases 



Macular dystrophies 



Retinal Vascular Disease 



Etiologic mechanisms in diabetic retinopathy 



Background diabetic retinopathy 



Proliferative diabetic retinopathy 



Retinal arterial obstructive disease 



Central retinal vein occlusion 



Retinal branch vein occlusion 



Pregnancy and retinal disease 



Pregnancy-induced hypertension 



Hypertension 



The rheumatic disease 



Parafoveal telangiectasis 



Coats disease 



Disseminated intravascular systemic coagulopathy and related vasculopathies 



Hemoglobinopathies 



Retinopathy of prematurity 



Acquired retinal macroaneurysms 



Eales disease 



Radiation retinopathy 



The ocular ischemic syndrome 



Inflammatory Disease 



Ocular toxoplasmosis 



Ocular toxocariasis 



Ocular cysticercosis 



Cytomegalovirus infections of the retina 



Retinal and ophthalmologic manifestations of AIDS 



Acute retinal necrosis syndrome 
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Other Diseases 



Endogenous fungal infections of the retina and choroid 



Pars planitis 



Syphilis and tuberculosis 



Diffuse unilateral subacute neuroretinitis 



Scleritis 



Birdshot retinochoroidopathy 



Punctate inner choroidopathy 



Sarcoidosis 



Acute multifocal placoid pigment epitheliopathy 



10 



Geographic helicoid peripapillary choroidopathy (GHPC): serpiginous 
choroiditis 



Sympathetic ophthalmia 



Vogt-Koyanigi-Harada syndrome (uveomeningitic syndrome) 



Ciliochoroidal (uveal) effusion 



15 



Reproduced from: Stereoscopic Atlas of Ocular Diseases Diagnosis and Treat- 
ment, 2nd Edition, J. Donald O. Gass, Vol. 1 & 2, C.V. Mosley Co. (1987); 
and Retina Vol. II, Editor, Stephen J. Ryan, Medical Retina, C.V. Moslay Co. 
(1989). 

20 2. Therapeutic products 

Therapeutic products include but are not limited to, wild-type genes that 
are defective in ocular disorders, such as rhodopsin, or fragments thereof 
sufficient to correct the genetic defect, trophic factors, including growth factors, 
inhibitors and agonists of trophic factors, anti-apoptosis factors and other 
25 products described herein or known to those of skill in the art to be useful for 
treatment of disorders of the eye or that can be treated by a product expressed 
by a photoreceptor. 



OCULAR GENE THERAPY STRATEGIES 


GENERAL DISEASE 


EXAMPLES 


STRATEGY 
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OCULAR GENE THERAPY STRATEGIES 


Hereditary retinal and 
macular degeneration 


• Retinitis pigmentosa 

• Stargardt's disease 

• Other macular 
dystrophies 


Growth factors (e.g., 
GDNF) 

anti-apoptotic factors 
(e.g., bcl2 gene) 
Stargardt Disease Gene 
(ABCR) r 


Neovascular 


• Diabetes 

• Choroidal 
neovascularization 


Anti-angiogenesis 
factors 


Anti-tumor 


Retinoblastoma 


Antiproliferant 


Glaucoma 


Nerve fiber layer 
atrophy 


Neuroprotective agent 



See Allikmets et al. (1997) Science 277:1805-1807. 



For example, for treatment of retinitis pigmentosa the adenovirus vector 
can deliver a wild-type rhodopsin gene or a growth factor or trophic factor, such 
as ciliary neurotrophic factor CNTF; for treatment of Stargardt's disease, the 
10 vector can deliver a wild type ABCR (also called STGD1) or a growth factor or 
anti-angiogenic agent; for diabetic retinopathies, retinal vascularization the 
vector can deliver growth factors, such as a TGF (TGF/?), to prevent 
degeneration. 

1 5 The following examples are included for illustrative purposes only and are 

not intended to limit the scope of the invention. 

EXAMPLE 1 

Preparation of Adenovirus Packaging Cell Lines 

Cell lines that are commonly used for growing adenovirus are useful as 
20 host cells for the preparation of adenovirus packaging cell lines. Preferred cells 
include 293 cells, an adenovirus-transformed human embryonic kidney cell line 
obtained from the ATCC, having Accession Number CRL 1573; HeLa, a human 
epithelial carcinoma cell line (ATCC Accession Number CCL-2); A549, a human 
lung carcinoma cell line (ATCC Accession Number CCL 1 889); and other 
25 epithelial-derived cell lines. As a result of the adenovirus transformation, the 
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293 cells contain the E1 early region regulatory gene. All cells were maintained 

in complete DMEM + 10% fetal calf serum unless otherwise noted. 

These cell lines allow the production and propagation of adenovirus-based 

gene delivery vectors that have deletions in preselected gene regions and that 

5 are obtained by cellular complementation of adenoviral genes. To provide the 

desired complementation of such deleted adenoviral genomes in order to 

generate a viral vector, plasmid vectors that contain preselected functional units 

have been designed. Such units include but are not limited to E1 early region, 

E4 and the viral fiber gene. The preparation of plasmids providing such 

10 complementation, thereby being "complementary plasmids or constructs/' that 

are stably inserted into host cell chromosomes are described below. 

A. Preparation of an E4-Expressing Plasmid for Complementation of 
E4-Gene-Deleted Adenoviruses 

The viral E4 regulatory region contains a single transcription unit that is 

15 alternately spliced to produce several different mRNA products. The 

E4-expressing plasmid prepared as described herein and used to transfect the 
293 cell line contains the entire E4 transcription unit. A DNA fragment 
extending from 1 75 nucleotides upstream of the E4 transcription start site 
including the natural E4 promoter to 1 53 nucleotides downstream of the E4 

20 polyadenylation signal including the natural E4 terminator signal, corresponding 
to nucleotides 32667-35780 of the adenovirus type 5 (hereinafter referred to as 
Ad5) genome as described in Chroboczek et al. {ViroL, 186:280-285 (1992), 
GenBank Accession Number M73260), was amplified from Ad5 genomic DNA, 
obtained from the ATCC, via the polymerase chain reaction (PCR). Sequences of 

25 the primers used were 5 , CGGTACACA GAATTC AGGAGACACAACTCC3 / 
(forward or 5' primer referred to as E4L) (SEQ ID NO: 1) and 
5'GCCT GGATCC GGGAAGTTACGTAACGTGGGAAAAC3 / (SEQ ID NO: 2) 
(backward or 3' primer referred to as E4R). To facilitate cloning of the PCR 
fragment, these oligonucleotides were designed to create new sites for the 

30 restriction enzymes EcoRI and BamHI, respectively, as indicated with underlined 
nucleotides. DNA was amplified via PCR using 30 cycles of 92 C for 1 minute. 
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50 C for 1 minute, and 72 C for 3 minutes resulting in amplified full-length E4 
gene products. 

The amplified DNA E4 products were then digested with EcoRI and 
BamHI for cloning into the compatible sites of pB!uescript/SK + by standard 
5 techniques to create the plasmid pBS/E4. A 2603 base pair (bp) cassette 
including the herpes simplex virus thymidine kinase promoter, the hygromycin 
resistance gene, and the thymidine kinase polyadenylation signal was excised 
from the plasmid pMEP4 (Invitrogen, San Diego, CA) by digestion with Fspl 
followed by addition of BamHI linkers (5'CGCGGATCCGCG3') (SEQ ID NO: 3) 
10 for subsequent digestion with BamHI to isolate the hygromycin-containing 
fragment. 

The isolated BamHI-modified fragment was then cloned into the BamHI 
site of pBS/E4 containing the E4 region to create the plasmid pE4/Hygro 
containing 8710 bp. The pE4/Hygro plasmid has been deposited with the ATCC 

15 under accession number 97739. The complete nucleotide sequence of 

pE4/Hygro is set forth in SEQ ID NO: 4. Position number 1 of the linearized 
vector corresponds to approximately the middle portion of the pBS/SK + 
backbone. The 5' and 3' ends of the E4 gene are located at respective 
nucleotide positions 3820 and 707 of SEQ ID NO: 4 while the 5' and 3' ends of 

20 the hygromycin insert are located at respective nucleotide positions 3830 and 

6470. In the clone that was selected for use, the E4 and hygromycin resistance 

genes were divergently transcribed. 

B. Preparation of a Fiber-Expressing Plasmid for Complementation of 
Fiber-Gene-Deleted Adenoviruses 

25 To prepare a fiber-encoding construct, primers were designed to amplify 

the fiber coding region from Ad5 genomic DNA with the addition of unique 
BamHI and Notl sites at the 5' and 3' ends of the fragment, respectively. The 
Ad5 nucleotide sequence is available with the GenBank Accession Number 
M18369. The 5' and 3' primers had the respective nucleotide sequences of 

30 5'ATGGGATCCAAGATGAAGCGCGCAAGACCG3' (SEQ ID NO: 5) and 

5 ' C AT A AC GCGGCCGCTTCTTT ATTCTTG G G C 3 ' {SEQ ID NO: 6), where the 
inserted BamHI and Notl sites are indicated by underlining. The 5' primer also 



WO 01/83729 



PCT/EP01/04863 



-53- 

contained a nucleotide substitution 3 nucleotides 5' of the second ATG codon (C 
to A) that is the initiation site. The nucleotide substitution was included so as to 
improve the consensus for initiation of fiber protein translation. 

The amplified DNA fragment was inserted into the BamHI and Notl sites 
5 of pcDNA3 (Invitrogen) to create the plasmid designated pCDNA3/Fiber having 
7148 bp. The parent plasmid contained the CMV promoter, the bovine growth 
hormone (BHG) terminator and the gene for conferring neomycin resistance. The 
viral sequence included in this construct corresponds to nucleotides 
31040-32791 of the Ad5 genome. 

10 The complete nucleotide sequence of pCDNA3/Fiber is listed in SEQ ID 

NO: 7 where the nucleotide position 1 corresponds to approximately the middle 
of the pcDNA3 vector sequence. The 5' and 3' ends of the fiber gene are 
located at respective nucleotide positions 916 with ATG and 2661 with TAA. 

To enhance expression of fiber protein by the constitutive CMV promoter 

15 provided by the pcDNA vector, a Bglll fragment containing the tripartite leader 
(TPL) of adenovirus type 5 was excised from pRD1 12a (Sheay et aL, 
BioTechniques, 15:856-862 (1993) and inserted into the BamHI site of 
pCDNA3/Fiber to create the plasmid pCLF having 7469 bp. The adenovirus 
tripartite leader sequence, present at the 5' end of all major late adenoviral 

20 mRNAs as described by Logan etaL, Proc.JNatl. Acad. ScL, USA, 81:3655- 

3659 (1984) and Berkner, BioTechniques, 5:616-629 (1988), also referred to as 
a "partial TPL" since it contains a partial exon 1 , shows correspondence with the 
Ad5 leader sequence having three spatially separated exons corresponding to 
nucleotide positions 6081-6089 (the 3' end of the first leader segment), 7111- 

25 7182 (the entire second leader segment), and 9644-9845 (the third leader 

segment and sequence downstream of that segment). The corresponding cDNA 
sequence of the partial tripartite leader sequence present in pCLF is included in 
SEQ ID NO: 8 bordered by BamHI/Bglll 5' and 3' sites at respective nucleotide 
positions 907-912 to 1228-1233. The nucleotide sequence of an isolated partial 

30 TPL is also listed separately as SEQ ID No. 22 with the noted 5' and 3' 

restriction sites and with the following nucleotide regions identified: 1-6 nt Bglll 
site; 1-18 nt polylinker; 19-27 nt last 9 nt of the first leader segment (exon 1); 
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28-99 nt second leader segment (exon 2); 100-187 nt third leader segment 
(exon 3); 188-301 nt contains the nt sequence immediately following the third 
leader in the genome with an unknown function; and 322-327 nt Bglll site. 
The pCLF plasmid has been deposited with the ATCC as described in 
5 Example 4. The complete nucleotide sequence of pCLF is listed in SEQ ID NO: 8 
where the nucleotide position 1 corresponds to approximately the middle of the 
pcDNA3 parent vector sequence. The 5' and 3' ends of the Ad5 fiber gene are 
located at respective nucleotide positions 1237-1239 with ATG and 2980-2982 
with TAA. 

10 C- Generation of an Adenovirus Packaging Cell Line Carrying Plasmids 

Encoding Functional E4 and Fiber Proteins 

The 293 cell line was selected for preparing the first adenovirus 

packaging line as it already contains the E1 gene as prepared by Graham eta/., 

J. Gen. Virol., 36:59-74 (1977) and as further characterized by Spector, Virol. , 

15 730:533-538 (1983). Before electroporation, 293 cells were grown in RPMI 

medium + 10% fetal calf serum. Four x 106 cells were electroporated with 20 
fjg each of pE4/Hygro DNA and pCLF DNA using a BioRad GenePulser and 
settings of 300 V, 25 jl/F. DNA for electroporation was prepared using the 
Qiagen system according to the manufacturer's instructions (Bio-Rad, Richmond, 

20 CA). 

Following electroporation, cells were split into fresh complete DMEM 4- 
10% fetal calf serum containing 200/yg/ml Hygromycin B (Sigma, St. Louis, 
MO). 

From expanded colonies, genomic DNA was isolated using the 
25 "MICROTURBOGEN" system (Invitrogen) according to manufacturer's 

instructions. The presence of integrated E4 DNA was assessed by PCR using 
the primer pair E4R and ORF6L (5'TGCTTAAGCGGCCGCGAAGGAGA AGTCC3') 
(SEQ ID NO: 9), the latter of which is a 5 # forward primer near adenovirus 5 
open reading frame 6. 
30 One clone, designated 21 1, was selected exhibiting altered growth 

properties relative to that seen in parent cell line 293. The 211 clone contained 
the product,indicating the presence of inserted DNA corresponding to most, if 
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not all, of the E4 fragment contained in the pE4/Hygro plasmid. The 21 1 cell 
line has been deposited with the ATCC as described in Example 4. This line was 
further evaluated by amplification using the primer pair E4L/E4R described 
above, and a product corresponding to the full-length E4 insert was detected. 
5 Genomic Southern blotting was performed on DNA restricted with EcoRI and 
BamHI. The E4 fragment was then detected at approximately one copy/genome 
compared to standards with the EcoRI/BamHI E4 fragment as cloned into pBS/E4 
for use as a labeled probe with the Genius system according to manufacturer's 
instructions (Boehringer Mannheim, Indianapolis, IN). In DNA from the 211 cell 

10 line, the labeled internal fragment pE4/Hygro hybridized with the isolated E4 

sequences. In addition, the probe hybridized to a larger fragment which may be 
the result of a second insertion event. 

Although the 21 1 cell line was not selected by neomycin resistance, thus 
indicating the absence of fiber gene, to confirm the lack of fiber gene, the 21 1 

1 5 cell line was analyzed for expression of fiber protein by indirect 

immunofluorescence with an anti-fiber polyclonal antibody and a FITC-labeled 
anti-rabbit IgG (KPL) as secondary. No immunoreactivity was detected. 
Therefore, to generate 21 1 clones containing recombinant fiber genes, the 211 
clone was expanded by growing in RPMI medium and subjected to additional 

20 electroporation with the fiber-encoding pCLF plasmid as described above. 

Following electroporation, cells were plated in DMEM + 10% fetal calf 
serum and colonies were selected with 200 //g/ml G418 (Gibco, Gaithersburg, 
MD). Positive cell lines remained hygromycin resistant. These candidate 
sublines of 21 1 were then screened for fiber protein expression by indirect 

25 immunofluorescence as described above. The three sublines screened, 211 A, 
21 1B and 21 1R, along with a number of other sublines, all exhibited nuclear 
staining qualitatively comparable to the positive control of 293 cells infected 
with AdRSV/?gal (1 pfu/cell) and stained 24 hours post-infection. 

Lines positive for nuclear staining in this assay were then subjected to 

30 Western blot analysis under denaturing conditions using the same antibody. 

Several lines in which the antibody detected a protein of the predicted molecular 
weight (62 kd for the Ad5 fiber protein) were selected for further study including 
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21 1 A, 21 1 B and 21 1R. The 21 1 A cell line has been deposited with ATCC as 

described in Example 4. 

Immunoprecipitation analysis using soluble nuclear extracts from these 

three cell lines and a seminatiye electrophoresis system demonstrated that the 

5 fiber protein expressed is in the functional trimeric form characteristic of the 

native fiber protein. The predicted molecular weight of a trimerized fiber is 186 

kd. Under denaturing conditions, the trimeric form was destroyed resulting in 

detectable fiber monomers. Those clones containing endogenous E1, newly 

expressed recombinant E4 and fiber proteins were selected for use in 

10 complementing adenovirus gene delivery vectors having the corresponding 

adenoviral genes deleted as described in Example 2. 

D. Preparation of an E1 -Expressing Plasmid for Complementation of 
E1 -Gene-Deleted Adenoviruses 

In order to prepare adenoviral packaging cell lines other than those based 

15 on the E1-gene containing 293 cell line as described in Example 1C above, 

plasmid vectors containing E1 alone or in various combinations with E4 and fiber 
genes are constructed as described below. 

The region of the adenovirus genome containing the E1a and E1b gene is 
amplified from viral genomic DNA by PCR as previously described. The primers 

20 used are E1L, the 5' or forward primer, and E1R, the 3' or backward primer, 
having the respective nucleotide sequences 5'CCG A GCTAGC 
GACTGAAAATGAG3' (SEQ ID NO: 10) and 5'CC TCTCGAG AGACAGC 
AAGACAC3' (SEQ ID NO: 11). The E1L and E1R primers include the respective 
restriction sites Nhel and Xhol as indicated by the underlines. The sites are used 

25 to clone the amplified E1 gene fragment into the Nhel/Xhol sites in pMAM 
commercially available from Clontech (Palo Alto, CA) to form the plasmid 
pDEX/EI having 11152 bp. 

The complete nucleotide sequence of pDEX/E1 is listed in SEQ ID NO: 12 
where the nucleotide position 1 corresponds to approximately 1 454 nucleotides 

30 from the 3' end of the pMAM backbone vector sequence. The pDEX/E1 plasmid 
includes nucleotides 552 to 4090 of the adenovirus genome positioned 
downstream (beginning at nucleotide position 1460 and ending at 4998 in the 
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pDEX/E1 plasmid) of the glucocorticoid-inducible mouse mammary tumor virus 

(MMTV) promoter of pMAM. The pMAM vector contains the E. coli gpt gene 

that allows stable transfectants to be isolated using hypoxanthine/amino- 

pterin/thymidine (HAT) selection. The pMAM backbone occupies nucleotide 

5 positions 1-1454 and 5005-1 1 152 of SEQ ID NO: 12. 

E. Generation of an Adenovirus Packaging Cell Line Carrying Plasmids 
Encoding Functional El, and Fiber Proteins 

To create separate adenovirus packaging cell lines equivalent to that of 

the 21 1 sublines, 21 1 A, 21 1 B and 21 1 R, as described in Example 1 C, 

10 alternative cell lines lacking adenoviral genomes are selected for transfection 
with the plasmid constructs as described below. Acceptable host cells include 
A549, Hela, Vero and the like cell lines as described in Example 1 . The selected 
cell line is transfected with the separate plasmids, pDEX/E1and pCLF, 
respectively for expressing E1, and fiber complementary proteins. Following 

15 transfection procedures as previously described, clones containing stable 

insertions of the two plasmids are isolated by selection with neomycin and HAT. 
Integration of full-length copy of the E1 gene is assessed by PCR amplification 
from genomic DNA using the primer set E1L/E1R , as described above. 
Functional insertion of the fiber gene is assayed by staining with the anti-fiber 

20 antibody as previously described. 

The resultant stably integrated cell line is then used as a packaging cell 

system to complement adenoviral gene delivery vectors having the 

corresponding adenoviral gene deletions as described in Example 2. 

F. Preparation of a Plasmid Containing Two or More Adenoviral Genes for 
25 Complementing Gene-Deleted Adenoviruses 

The methods described in the preceding Examples rely on the use of two 

plasmids, pE4/Hygro and pCLF, or, pCLF and pDEX/E1 for generating adenoviral 

cell packaging systems. In alternative embodiments, complementing plasmids 

containing two or more adenoviral genes for expressing of encoded proteins in 

30 various combinations are also prepared as described below. The resultant 

plasmids are then used in various cell systems with delivery plasmids having the 

corresponding adenoviral gene deletions. The selection of packaging cell, 

content of the delivery plasmids and content of the complementing plasmids for 
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use in generating recombinant adenovirus viral vectors thus depends on whether 
other adenoviral genes are deleted along with the adenoviral fiber gene, and, if 
so, which ones. 

1 . Preparation of a Complementing Plasmid Containing Fiber 
5 and El Adenoviral Genes 

A DNA fragment containing sequences for the CMV promoter, adenovirus 

tripartite leader, fiber gene and bovine growth hormone terminator is amplified 

from pCLF prepared in Example 1B using the forward primer 

5'GACGGATCGGGAGATCTCC3' (SEQ ID NO: 13), that anneals to the 

10 nucleotides 1-19 of the pCDNA3 vector backbone in pCLF, and the backward 
primer 5XCGCCTCAGAAGCCATAGAGCC3' {SEQ ID NO: 14) that anneals to 
nucleotides 1278-1257 of the pCDNA3 vector backbone. The fragment is 
amplified as previously described and then cloned into the pDEX/E1 plasmid, 
prepared in Example 1D. For cloning in the DNA fragment, the pDEX/E1 vector 

15 is first digested with Ndel, that cuts at a unique site in the pMAM vector 
backbone in pDEX/E1, then the ends are repaired by treatment with 
bacteriophage T4 polymerase and dNTPs. 

The resulting plasmid containing E1 and fiber genes, designated 
pE1 /Fiber, provides dexamethasone-inducible El function as described for 

20 DEX/E1 and expression of Ad5 fiber protein as described above. 

The complete nucleotide sequence of pE1 /Fiber is listed in SEQ ID NO: 15 
where the nucleotide position 1 corresponds to approximately 1459 nucleotides 
from the 3' end of the parent vector pMAM sequence. The 5' and 3' ends of 
the Ad5 E1 gene are located at respective nucleotide positions 1460 and 4998 

25 followed by pMAM backbone and then separated from the Ad5 fiber from pCLF 
by the filled-in blunt ended Ndel site. The 5' and 3' ends of the pCLF fiber gene 
fragment are located at respective nucleotide positions 10922-14223 containing 
elements as previously described for pCLF. 

The resultant pE1 /Fiber plasmid is then used to complement one or more 

30 delivery plasmids expressing E1 and fiber. 

The pE1 /Fiber construct is then used to transfect a selected host cell as 
described in Example 1 E to generate stable chromosomal insertions preformed as 
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previously described followed by selection on HAT medium. The stable cells are 

then used as packaging cells as described in Example 2. 

2. Preparation of a Complementing Piasmid Containing E4 and 
Fiber Adenoviral Genes 

5 Piasmid pCLF prepared as described in Example 1 B is partially digested 

with Bglll to cut only at the site in the pCDNA3 backbone. The pE4/Hygro 

piasmid prepared in Example 1A is digested with BamHI to produce a fragment 

containing E4. The E4 fragment is then inserted into the BamHI site of pCLF to 

form piasmid pE4/Fiber. The resultant piasmid provides expression of the fiber 

10 gene as described for pCLF and E4 function as described for pE4/Hygro. 

A schematic piasmid map of pE4/Fiber, having 10610 bp. The complete 
nucleotide sequence of pE4/Fiber is listed in SEQ ID NO: 16 where the 
nucleotide position 1 corresponds to approximately 14 bp from the 3' end of the 
parent vector pCDNA3 backbone sequence. The 5' and 3' ends of the Ad5 E4 

15 gene are located at respective nucleotide positions 21 and 3149 followed by 
fused Bglll/BamHI sites and pCDNA3 backbone including the CMV promoter 
again followed by Bglll/BamHI sites. The adenovirus leader sequence begins at 
nucleotide position 4051 and extends to 4366 followed by fused BamHI/Bglll 
sites and the 5' and 3' ends of the fiber gene located at respective nucleotide 

20 positions 4372 and 6124. 

Stable chromosonal insertions of pE4/Fiber in host cells are obtained as 
described above. 

EXAMPLE 2 

Preparation of Adenoviral Gene Delivery Vectors Using Adenoviral Packaging Cell 
25 Lines 

Adenoviral delivery vectors are prepared to separately lack the 
combinations of E1 /fiber and E4/fiber. Such vectors are more replication- 
defective than those previously in use due to the absence of multiple viral genes. 
A preferred adenoviral delivery vector is replication competent but only via a 
30 non-fiber means is one that only lacks the fiber gene but contains the remaining 
functional adenoviral regulatory and structural genes. Furthermore, these 
adenovirus delivery vectors have a higher capacity for insertion of foreign DNA. 
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A. Preparation of Adenoviral Gene Delivery Vectors Having Specific Gene 
Deletions and Methods of Use 

To construct the E1/ fiber deleted viral vector containing the LacZ 
reporter gene construct, two new plasmids were constructed. The plasmid pA 
5 ElB/?gal was constructed as follows. A DNA fragment containing the SV40 

regulatory sequences and E. coli /?-galactosidase gene was isolated from pSV/?gal 
(Promega) by digesting with Vspl, filling the overhanging ends by treatment with 
Klenow fragment of DNA polymerase I in the presence of dNTP's and digesting 
with Bam H1 . The resulting fragment was cloned into the EcoRV and BamHI 

10 sites in the polylinker of pA E1sp1B (Microbix Biosystems, Hamilton, Ontario) to 
form pA E1 B /?gal that therefore contained the left end of the adenovirus genome 
with the Ela region replaced by the LacZ cassette (nucleotides 6690 to 4151) of 
pSVfi gal. Plasmid DNA may be prepared by the alkaline lysis method as 
described by Birnboim and Doly, Nuc. Acids Res., 7:1513-1523 (1978) or by the 

15 Quiagen method according to the manufacturer's instruction, from transformed 
cells used to expand the plasmid DNA was then purified by CsCI-ethidium 
bromide density gradient centrifugation. Alternatively, plasmid DNAs may be 
purified from E. coli by standard methods known in the art (e.g. see Sambrook et 
a I.) 

20 The second plasmid (pDV44), prepared as described herein, is derived 

from pBHGIO, a vector prepared as described by Bett et a/., Proc. Natl. Acad. 
Sci. r USA, 91:8802-8806 (1994) (see, also International PCT application No. 
W0 95/00655) using methods well known to one of skill in the art. This vector 
is also commercially available from Microbix and and contains an Ad5 genome 

25 with the packaging signals at the left end deleted and the E3 region (nucleotides 
28133:30818) replaced by a linker with a unique site for the restriction enzyme 
Pad. An 1 1 .9 kb BamHI fragment, which contains the right end of the 
adenovirus genome, is isolated from pBHGIO and cloned into the BamHI site of 
pBS/SK( + ) to create plasmid p1 1.3 having approximately 14,658 bp. The p11.3 

30 plasmid was then digested with Pad and Sail to remove the fiber, E4, and 
inverted terminal repeat (ITR) sequences. 
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This fragment was replaced with a 3,4 kb fragment containing the ITR 
segments and the E4 gene which was generated by PGR amplification from 
pBHGIO using the following oligonucleotide sequences: 
5' TGTACACCG GATCCGGCGCACACC3' SEQ ID NO: 17; and 
5 5'CACAACGAGCTC AATTAATTAATTGCCACATCCTC3' SEQ ID NO: 18. 

These primers incorporated sites for Pad and BamHI. Cloning this fragment into 
the Pad and blunt ended Sail sites of the p1 1.3 backbone resulted in a 
substitution of the fused ITRs, E4 region and fiber gene present in pBHGIO, by 
the ITRs and E4 region alone. The resulting p1 1.3 plasmid containing the ITR 

10 and E4 regions, designated plasmid pDV43a, was then digested with BamHI. 
This BamHI fragment was then used to replace a BamHI fragment in pBHGIO 
thereby creating pDV44 in a pBHGIO backbone. 

In an alternative approach to preparing pDV44 with an additional 
subcloning step to facilitate the incorporation of restriction cloning sites, the 

15 following cloning procedure was performed. pDV44 as above was constructed 
by removing the fiber gene and some of the residual E3 sequences from pBHGIO 
(Microbix Biosystems). As above, to simplify manipulations, the 1 1 .9 kb BamHI 
fragment including the rightmost part of the Ad5 genome was removed from 
pBHGIO and inserted into pBS/SK. The resulting plasmid was termed p1 1.3. 

20 The 3.4 kb DNA fragment corresponding to the E4 region and both ITRs of 
adenovirus type 5 was amplified as described above from pBHGIO using the 
oligonucleotides listed above and subcloned into the vector pCR2.1 (Invitrogen) 
to create pDV42. This step is the additional cloning step to facilitate the 
incorporation of a Sail restriction site. pDV42 was then digested with Pad, 

25 which cuts at a unique site (bold type) in one of the PCR primers, and with Sail, 
which cuts at a unique site in the pCR2.1 polylinker. This fragment was used to 
replace the corresponding Pacl/Xhol fragment of p1 1 .3 (the pBS polylinker 
adjacent to the Ad DNA fragment contains a unique Xhol site), creating pDV43. 

30 A plasmid designated pDV44 was constructed by replacing the 1 1 .9 kb 

BamHI fragment of pBHGIO by the analogous BamHI fragment of pDV43. As 
generated in the first procedure, pDV44 therefore differs from pBHGIO by the 



WO 01/83729 



PCT/EP01/04863 



-62- 

deletion of Ad5 nucleotides 30819:32743 {residual E3 sequences and all but the 
3'-most 41 nucleotides of the fiber open reading frame). 

Thus, to summarize, the cloning procedures described above result in the 
production of a fiber-deleted Ad5 genomic plasmid (pDV44) that was 
5 constructed by removing the fiber gene and some of the residual E3 sequences 
from pBHGIO. pDV44 contains a wild-type E4 region, but only the last 41 
nucleotides of the fiber ORF (this sequence was retained to avoid affecting 
expression of the adjacent E4 transcription unit). Plasmids pBHGIO and pDV44 
contain unpackageable Ad5 genomes, and must be rescued by cotransfection 

10 and subsequent homologous recombination with DNA carrying functional 

packaging signals. In order to generate vectors marked with a reporter gene, 
either pDV44 or pBHGIO was cptransfected with pAEIBfcgal, which contains 
the left end of the Ad5 genome with an SV40-driven S-galactosidase reporter 
gene inserted in place of the E1 region. 

15 In general, and as described below, the method for virus production by 

recombination of plasmids followed by complementation in cell culture involves 
the isolation of recombinant viruses by cotransfection of any one of the 
adenovirus packaging cell systems prepared in Example 1, namely 211 A, 21 1B, 
21 1R, A549, Vero cells, and the like, with plasmids carrying sequences 

20 corresponding to viral gene delivery vectors. 

A selected cell line is plated in dishes and cotransfected with pDV44 and 
pAE1B/? gal using the calcium phosphate method as described by Bett et al., 
Proc. Natl. Acad. Sci., USA, 37:8802-8806 (1994). Recombination between the 
overlapping adenovirus sequences in the two plasmids leads to the creation of a 

25 full-length viral chromosome where pDV44 and pAE1B/? gal recombine to form a 
recombinant adenovirus vector having multiple deletions. The deletion of E1 and 
of the fiber gene from the viral chromosome is compensated for by the 
sequences integrated into the packaging cell genome, and infectious virus 
particles are produced. The plaques thus generated are isolated and stocks of 

30 the recombinant virus are produced by standard methods. 

Because of the fiber deletion, a pDV44-derived virus is 
replication-defective, cells in which it is grown must complement this defect. 
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The 21 1B cell line (a derivative of 293 cells which expresses the wild-type (wt) 
AD5 fiber and is equivalent to 21 1 A on deposit with ATCC as described in 
Example 4) was used for rescue and propagation of the virus described here. 
pDV44 and pAEIfcgal were cotransfected into 21 1B cells, and the monolayers 
5 were observed for evidence of cytopathic effect (CPE). Briefly, for virus 

construction, cells were transfected with the indicated plasmids using the Gibco 
Calcium Phosphate Transfection system according to the manufacturer's 
instructions and observed daily for evidence of CPE. 

One of a total of 58 transfected dishes showed evidence of spreading cell 

10 death at day 15. A crude freeze-thaw lysate was prepared from these cells and 
the resulting virus (termed Ad5.fcgal.AF) was plaque purified twice and then 
expanded. To prepare purified viral preparations, cells were infected with the 
indicated Ad and observed for completion of CPE. Briefly, at day zero, 21 1B 
cells were plated in DMEM plus 10% fetal calf serum at approximately 1 X 10 7 

15 cells/150 cm 2 flask or equivalent density. At day one, the medium was replaced 
with one half the original volume of fresh DMEM containing the indicated Ad, in 
this case Ad5.Sgal.AF, at approximately 100 particles/cell. At day two, an equal 
volume of medium was added to each flask and the cells were observed for CPE. 
Two to five days after infection, cells were collected and virus isolated by lysis 

20 via four rapid freeze-thaw cycles. Virus was then purified by centrifugation on 
preformed 1 5-40% CsCI gradients (1 1 1 ,000 x g for three hours at 4°C). The 
bands were harvested, dialyzed into storage buffer (10 mM Tris-pH 8.1, 0.9% 
NaCI, and 10% glycerol), aliquoted and stored at - 70°C. Purified Ad5.ISgal.AF 
virus particles containing human adenovirus Ad5.fSgal.AFgenome (described 

25 further below) have been deposited with the ATCC on January 1 5, 1 999 as 
further described in Example 4. 

For viral titering, as necessary in the below Examples, Ad preparations 
were titered by plaque assay on 21 1B cells. Cells were plated on 
polylysine-coated 6 well plates at 1 .5 x 10 6 cells/well. Duplicate dilutions of 

30 virus stock were added to the plates in 1 ml/well of complete DMEM. After a 
five hour incubation at 37 °C, virus was removed and the wells overlaid with 2 
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ml of 0.6% low-melting agarose in Medium 199 (Gibco). An additional 1 ml of 
overlay was added at five day intervals. 

As a control, the first-generation virus AdS.IS gal.wt, which is identical to 
Ad5.Bgal.AF except for the fiber deletion, was constructed by cotransfection of 
5 pBHGlO.and pAEIBSgal. In contrast to the low efficiency of recovery of the 
fiberless genome (1/58 dishes), all of 9 dishes cotransfected with pAE1B/?gal 
and pBHGIO produced virus. 

In another embodiment, a delivery plasmid is prepared that does not 
require the above-described recombination events to prepare a viral vector 
10 having a fiber gene deletion. In one embodiment, a single delivery plasmid 
containing all the adenoviral genome necessary for packaging but lacking the 
fiber gene is prepared from plasmid pFG140 containing full-length Ad5 that is 
commercially available from Microbix. The resultant delivery plasmid referred to 
as pFG140-f is then used with pCLF stably integrated cells as described above to 
1 5 prepare a viral vector lacking fiber. For genetic therapy, the fiber gene can be 
replaced with a therapeutic gene of interest for preparing a therapeutic delivery 
adenoviral vector. Methods for producing a fiberless vector with a complete TPL 
are described in Example 3. 

Vectors for the delivery of any desired gene and preferably a therapeutic 
20 gene are prepared by cloning the gene of interest into the multiple cloning sites 
in the polylinker of commercially available pAE1sp1B (Microbix Biosystems), in 
an analogous manner as performed for preparing pE1B£ gal as described above. 
The same cotransfection and recombination procedure is then followed as 
described herein to obtain viral gene delivery vectors as further discussed in later 
25 Examples. 

1 . Characterization of the Ad5./?gal.AF Genome 
To confirm that the vector genomes had the proper structures and that 
the fiber gene was absent from the Ad5.Sgal.AF chromosome, the DNA isolated 
from viral particles was analyzed. Briefly, purified viral DNA was obtained by 
30 adding 10 jul of 10 mg/ml proteinase K, 40 jj\ of 0.5 M EDTA and 50 jj\ of 10% 
SDS to 800 jjI of adenovirus-containing culture supernatant. The suspension 
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was then incubated at 55 °C for 60 minutes. The solution was then extracted 
once with 

400 //I of a 24:1 mixture of chloroform:isoamyl alchohol. The aqueous phase 
was then removed and precipitated with sodium acetate/ethanol. The pellet was 
5 washed once with 70% ethanol and lightly dried. The pellet was then 

suspended in 40 //I of 10 mM Tris-HCi, pH 8.0, 1 mM EDTA. Genomic DNA 
from Ad5.Bgal.wt and Ad5.Bgal.AF produced the expected restriction patterns 
following digestion with either EcoRI or with Ndel. Southern blotting, performed 
with standard methods, with labeled fiber DNA as a probe demonstrated the 

10 presence of fiber sequence in Ad5.Bgal.wt but not in Ad5.Bgal.AF DNA. As a 
positive control, the blot was stripped and reprobed with labeled E4 sequence. 
Fiber and E4 sequences were detected by using labeled inserts from pCLF and 
pE4/Hygro, respectively. E4 signal was readily detectable in both genomes at 
equal intensities. The complete nucleotide sequence of Ad5.Bgal.AF is presented 

15 in SEQ ID NO: 23 and is contained in the virus particle on deposit with ATCC. 

2. Characterization of the Fiberless Adenovirus Ad5./?gal.AF 
To verify that Ad5.Bgal.AF was fiber-defective, 293 cells (which are 
permissive for growth of E1 -deleted Ad vectors but do not express fiber) were 
infected with Ad5.Bgal.AF or with Ad5.Bgal.wt. Twenty-four hours post 

20 infection, the cells were stained with polyclonal antibodies directed either against 

fiber or against the penton base protein. Cells infected with either virus were 

stained by the anti-penton base antibody, while only cells infected with the 

Ad5.fcgal.wt control virus reacted with the anti-fiber antibody. This confirms 

that the fiber-deleted Ad mutant does not direct the synthesis of fiber protein. 

25 3. Growth of the Fiber-Deleted Ad5./?gal.AF Vector in 

Complementing Cells 

Ad5.BgaI.AF was found to readily be propagated in 21 1B cells. As 

assayed by protein concentration, CsCI-purified stocks of either Ad5.Bgal.AF or 

Ad5.Bgal.wt contained similar numbers of viral particles. The particles appeared 

30 to band normally on CsCI gradients. Infectivity of the Ad5.Bgal.AF particles was 

lower than the Ad5.Bgal.wt control, as indicated by an increased particle/PFU 

ratio. Ad5.Bgal.AF was also found to plaque more slowly than the control 
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virus. When plated on 21 1B cells, Ad5.Bgal.wt plaques appeared within 5-7 
days, while plaques of Ad5.ISgal.AF continued to appear until as much as 15-18 
days post infection. Despite their slower formation, the morphology of 
Ad5.Sgal.AF plaques was essentially normal. 
5 4. Production of Fiberless Ad5.ftgal.AF Particles 

As Ad5.Sgal.AF represents a true fiber null mutation and its stocks are 
free of helper virus, the fiber mutant phenotype was readily investigated. A 
single round of growth in cells (such as 293) which do not produce fiber 
generating a homogeneous preparation of fiberless Ad allowed for the 

10 determination of whether such particles would be stable and/or infectious. 
Either Ad5.Sgal.wt or Ad5.Sgal.AF was grown in 293 or 21 1B cells, and the 
resulting particles purified on CsCI gradients as previously described. 
Ad5.Ugal.AF particles were readily produced in 293 cells at approximately the 
same level as the control virus and behaved similarly on the gradients, indicating 

15 that there was not a gross defect in morphogenesis of fiberless capsids. 

Particles of either virus contained similar amounts of penton base 
regardless of the cell type in which they were grown. This demonstrated that 
fiber is not required for assembly of the penton base complex into virions. The 
Ad5.Kgal.AF particles produced in 293 cells did not contain fiber protein. 

20 21 IB-grown Ad5.Sgal.AF also contained less fiber than the Ad5.Sgai.wt control 
virus. The infectivities of the different viral preparations on epithelial cells 
correlated with the amount of fiber protein present. The fiberless Ad particles 
were several thousand-fold less infectious than the first-generation vector control 
on a per-particle basis, while infectivity of 21 IB-grown Ad5.Sgal.AF was only 

25 50-100 fold less than that of Ad5.Sgal.wt. These studies confirmed fiber's 

crucial role in infection of epithelial cells via CAR binding. 

5. Composition and Structure of the Fiberless Ad5.llgal.AF 
Particles 

The proteins contained in particles of 293-grown Ad5.Sgal.AF were 
30 compared to those in Ad5.Sgal.wt, to determine whether proteolysis or particle 
assembly was defective in this fiber null mutant. The overall pattern of proteins 
in the fiberless particles was observed to be quite similar to that of a 
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first-generation vector, with the exception of reduced intensity of the composite 
band resulting from proteins Ilia and IV (fiber). The fiberless particles also had a 
reduced level of protein VII. Although substantial amounts of uncleaved 
precursors to proteins VI, VII, and VIII were not seen, it is possible that the 
5 low-molecular weight bands migrating ahead of protein VII represent either 
aberrantly cleaved viral proteins or their breakdown products. 

Cryo-electron microscopy was used to more closely examine the structure 
of the 293 grown Ad5.fSgal.AF and of Ad5fSgal.wt. The fiber, having an 
extended stalk with a knob at the end, was faintly visible in favorable 

10 orientations of wild-type Ad5 particles, but not in images of the fiberless 

particles. Filamentous material likely corresponding to free viral DNA was seen 
in micrographs of fiberless particles. This material was also present in 
micrographs of the first-generation control virus, albeit at much lower levels. 
Three-dimensional image reconstructions of fiberless and wild-type 

15 particles at — 20 A resolution showed similar sizes and overall features, with the 
exception that fiberless particles lacked density corresponding to the fiber 
protein. The densities corresponding to other capsid proteins, including penton 
base and proteins Ilia, VI, and IX, were comparable in the two structures. This 
confirms that absence of fiber does not prevent assembly of these components 

20 into virions. The fiber was truncated in the wild-type structure as only the lower 
portion of its flexible shaft follows icosahedral symmetry. The RGD protrusions 
on the fiberless penton base were angled slightly inward relative to those of the 
wild-type structure. Another difference between the two penton base proteins 
was that there is a —30 A diameter depression in the fiberless penton base 

25 around the five-fold axis where the fiber would normally sit. The Ad5 

reconstructions confirm that capsid assembly, including addition of penton base 

to the vertices, is able to proceed in the complete absence of fiber. 

6. Integrin-Dependent Infectivity of Fiberless Ad5.ISgal.AF 
Particles 

30 While attachment via the viral fiber protein is a critical step in the 

infection of epithelial cells, an alternative pathway for infection of certain 
hematopoietic cells has been described. In this case, penton base mediates 
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binding to the cells (via fS2 integrins) and internalization (through interaction with 
av integrins). Particles lacking fiber might therefore be expected to be 
competent for infection of these cells, even though on a per-particle basis they 
are several thousand-fold less infectious than normal Ad vectors on epithelial 
5 cells. 

To investigate this, THP-1 monocytic cells were infected with 
Ad5.fSgaI.wt or with Ad5.fcgaI.AF grown in the absence of fiber. Infection of 
THP-1 cells was assayed by infecting 2 x 10 5 cells at the indicated m.o.i. in 0.5 
ml of complete RPMI. Forty-eight hours post-infection, the cells were fixed with 

10 glutaraldehyde and stained with X-gal, and the percentage of stained cells was 
determined by light microscopy. The results of the infection assay showed that 
the fiberless particles were only a few-fold less infectious than first-generation 
Ad on THP-1 cells. Large differences were seen in plaquing efficiency on 
epithelial (21 1B) cells. Infection of THP-1 cells by either Ad5.fcgal.AF or 

15 Ad5.Bgal.wt was not blocked by an excess of soluble recombinant fiber protein, 
but could be inhibited by the addition of recombinant penton base). These 
results indicate that the fiberless Ad particles use a fiber-independent pathway to 
infect these cells. Furthermore, the lack of fiber protein did not prevent 
Ad5.fcgalAF from internalizing into the cells and delivering its genome to the 

20 nucleus, demonstrating that fiberless particles are properly assembled and are 
capable of uncoating. 

The foregoing results with the recombinant viruses thus produced 
indicates that they can be used as gene delivery tools in cultured cells and in 
vivo as described more fully in the Examples. For example, for studies of the 

25 effectiveness and relative immunogenicity of multiply-deleted vectors, virus 

particles are produced by growth in the packaging lines described in Example 1 
and are purified by CsCI gradient centrifugation. Following titering, virus 
particles are administered to mice via systemic or local injection or by aerosol 
delivery to lung. The LacZ reporter gene allows the number and type of cells 

30 which are successfully transduced to be evaluated. The duration of transgene 
expression is evaluated in order to determine the long-term effectiveness of 
treatment with multiply-deleted recombinant adenoviruses relative to the 
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standard technologies which have been used in clinical trials to date. The 

immune response to the improved vectors described here is determined by 

assessing parameters such as inflammation, production of cytotoxic T 

lymphocytes directed against the vector, and the nature and magnitude of the 

5 antibody response directed against viral proteins. 

Versions of the vectors which contain therapeutic genes such as CFTR 

for treatment of cystic fibrosis or tumor suppressor genes for cancer treatment 

are evaluated in the animal system for safety and efficiency of gene transfer and 

expression. Following this evaluation, they are used as experimental therapeutic 

10 agents in human clinical trials. 

B. Retargeting of Adenoviral Gene Delivery Vectors by Producing 
Viral Particles Containing Different or Altered Fiber Proteins 

As the specificity of adenovirus binding to target cells is largely 

determined by the fiber protein, viral particles that incorporate modified fiber 

1 5 proteins or fiber proteins from different adenoviral serotypes (pseudotyped 
vectors) have different specificities. Thus, the methods of expression of the 
native Ad5 fiber protein in adenovirus packaging cells as described above is also 
applicable to production of different fiber proteins. 

Chimeric fiber proteins can be produced according to known methods 

20 (see, e.g., Stevenson et a/. (1995) J. Virol. , 63:2850-2857). Determinants for 
fiber receptor binding activity are located in the head domain of the fiber and an 
isolated head domain is capable of trimerization and binding to cellular receptors. 
The head domains of adenovirus type 3 (Ad3) and Ad5 were exchanged in order 
to produce chimeric fiber proteins. Similar constructs for encoding chimeric fiber 

25 proteins for use in the methods herein are contemplated. Thus, instead of the 
using the intact Ad5 fiber-encoding construct prepared in above and in U.S. 
application Serial No. 09/482,682) as a complementing viral vector in adenoviral 
packaging cells, the constructs described herein are used to transfect cells along 
with E4 and/or E1 -encoding constructs. 

30 Briefly, full-length Ad5 and Ad3 fiber genes were amplified from purified 

adenovirus genomic DNA as a template. The Ad5 and Ad3 nucleotide 
sequences are available with the respective GenBank Accession Numbers 
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M18369 and M1241 1 . Oligonucleotide primers are designed to amplify the 
entire coding sequence of the full-length fiber genes, starting from the start 
codon, ATG, and ending with the termination codon TAA. For cloning purposes, 
the 5' and 3' primers contain the respective restriction sites BamHI and Notl for 
5 cloning into pcDNA plasmid as described in Example 1 A. PCR is performed as 
described above. 

The resulting products are then used to construct chimeric fiber 
constructs by PCR gene overlap extension (Horton et al. (1990) BioTechniques, 
5:525-535). The Ad5 fiber tail and shaft regions (5TS; the nucleotide region 

10 encoding amino acid residue positions 1 to 403) are connected to the Ad3 fiber 
head region (3H; the nucleotide region encoding amino acid residue positions 
136 to 319) to form the 5TS3H fiber chimera. Conversely, the Ad3 fiber tail and 
shaft regions (3TS; the nucleotide region encoding amino acid residues positions 
1 to 135) are connected to the Ad5 fiber head region (5H; the nucleotide region 

15 encoding the amino acid residue positions 404 to 581) to form the 3TS5H fiber 
chimera. The fusions are made at the conserved TLWT (SEQ ID NO: 19) 
sequence at the fiber shaft-head junction. 

The resultant chimeric fiber PCR products are then digested with BamHI 
and Notl for separate directional ligation into a similarly digested pcDNA 3.1 . 

20 The TPL sequence is then subcloned into the BamHI as described in Example 1 A 
for preparing an expression vector for subsequent transfection into 211 cells as 
described above or into the alternative packaging cell systems as previously 
described. The resultant chimeric fiber construct-containing adenoviral 
packaging cell lines are then used to complement adenoviral delivery vectors as 

25 previously described. Other fiber chimeric constructs are obtained with the 
various adenovirus serotypes using a similar approach. 

In an alternative embodiment, the use of modified proteins including with 
modified epitopes (see, e.g., Michael et al. (1995) Gene Therapy, 2:660-668 
and International PCT application Publication No. WO 95/2641 2, which describe 

30 the construction of a cell-type specific therapeutic viral vector having a new 

binding specificity incorporated into the virus concurrent with the destruction of 
the endogenous viral binding specificity). In particular, the authors described the 
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production of an adenoviral vector encoding a gastrin releasing peptide (GRP) at 
the 3' end of the coding sequence of the Ad5 fiber gene. The resulting fiber- 
GRP fusion protein was expressed and shown to assemble functional fiber 
trimers that were correctly transported to the nucleus of HeLa cells following 
5 synthesis. 

Similar constructs are contemplated for use in the complementing 
adenoviral packaging cell systems for generating new adenoviral gene delivery 
vectors that are targetable, replication-deficient and less immunogenic. 
Heterologous ligands contemplated for use herein to redirect fiber specificity 

10 range from as few as 10 amino acids in size to large globular structures, some of 
which necessitate the addition of a spacer region so as to reduce or preclude 
steric hindrance of the heterologous ligand with the fiber or prevent trimerization 
of the fiber protein. The ligands are inserted at the end or within the linker 
region. Preferred ligands include those that target specific cell receptors or 

15 those that are used for coupling to other moieties such as biotin and avidin. 
A preferred spacer includes a short 1 2 amino acid peptide linker 
composed of a series of serines and alanine flanked by a proline residue at each 
end using routine procedures known to those of skill in the art. The skilled 
artisan will be with the preparation of linkers to accomplish sufficient protein 

20 presentation and to alter the binding specificity of the fiber protein without 
compromising the cellular events that follow viral internalization. Moreover, 
within the context of this disclosure, preparation of modified fibers having 
ligands positioned internally within the fiber protein and at the carboxy terminus 
as described below are contemplated for use with the methods described herein. 

25 

The preparation of a fiber having a heterologous binding ligand is prepared 
essentially as described in the above-cited paper. Briefly, for the ligand of 
choice, site-directed mutagenesis is used to insert the coding sequence for a 
linker into the 3' end of the Ad5 fiber construct in pCLF as prepared in Example 
30 1. 

The 3' or antisense or mutagenic oligonucleotide encodes a preferred 
linker sequence of ProSerAlaSerAlaSerAlaSerAlaProGlySer (SEQ ID NO: 20) 
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followed by a unique restriction site and two stop codons, respectively, to allow 
the insertion of a coding sequence for a selected heterologous ligand and to 
ensure proper translation termination. Flanking this linker sequence, the 
mutagenic oligonucloetide contains sequences that overlap with the vector 
5 sequence and allow its incorporation into the construct. Following mutagenesis 
of the pCLF sequence adding the linker and stop codon sequences, a nucleotide 
sequence encoding a preselected ligand is obtained, linkers corresponding to the 
unique restriction site in the modified construct are attached and then the 
sequence is cloned into linearized corresponding restriction site. The 

10 resultant fiber-ligand construct is then used to transfect 21 1 or the alternative 
cell packaging systems previously described to produce complementing viral 
vector packaging systems. 

In a further embodiment, intact fiber genes from different Ad serotypes 
are expressed by 21 1 cells or an alternative packaging system as previously 

15 described. A gene encoding the fiber protein of interest is first cloned to create 
a plasmid analogous to pCLF, and stable cell lines producing the fiber protein are 
generated as described above for Ad5 fiber. The adenovirus vector described 
which lacks the fiber gene is then propagated in the cell line producing the fiber 
protein relevant for the purpose at hand. As the only fiber gene present is the 

20 one in the packaging cells, the adenoviruses produced contain only the fiber 
protein of interest and therefore have the binding specificity conferred by the 
complementing protein. Such viral particles are used in studies such as those 
described above to determine their properties in experimental animal systems. 

EXAMPLE 3 

25 Tripartite leader sequences (TPLs) that are useful in enhancing the 

expression of complementing adenoviral proteins, particularly fiber protein, for 
use in preparing an adenoviral gene delivery vector are provided. The complete 
Ad5 TPL was constructed by assembling PCR fragments. First, the third TPL 
exon (exon 3) (nt 9644-9731 of the Ad5 genome) was amplified from Ad5 

30 genomic DNA using the synthetic oligonucleotide primers 

5'CTCAACAATTGTGGATCCGTACTCC3'(SEQ ID No. 24) and 
5'GTGCTCAGCAGATCTTGCGACTGTG3' (SEQ ID No. 25). The resulting 
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product was cloned to the BamHI and Bglll sites of pAE1Sp1a (Microbix 
Biosystems) using sites in the primers (shown in bold) to create plasmid pDV52. 
A fragment corresponding to the first TPL exon (exon 1), the natural first intron 
(intron 1), and the second TPL exon (exon 2) (Ad5 nt 6049-7182) was then 
5 amplified using primers 5'GGCGCGTTCGGATCCACTCTCTTCC3' (SEQ ID No. 
26) and 5'CTACATGCTAGGCAGATCTCGTTCGGAG3' (SEQ ID No. 27),and 
cloned into the BamHI site of pDV52 (again using sites in the primers) to create 
pDV55. 

This plasmid contains a 1 .2 kb BamHI/Bglll fragment containing the first 
10 TPL exon, the natural first intron, and the fused second and third TPL exons. 
The nucleotide sequence of the complete TPL containing the noted 5' and 3' 
restriction sites is shown in SEQ ID No 28 with the following nucleotide regions 
identified: 1-6 nt BamHI site; 7-47 nt first leader segment (exon 1); 48-1068 nt 
natural first intron (intron 1); 1069-1 140 nt second leader segment (exon 2); 
15 1 141-1 146 nt fused BamHI and Bglll sites; 1 147-1 234 nt third leader segment 
(exon 3); and 1235-1240 nt Bglll site. 

EXAMPLE 4 

Deposit of Materials 

The following cell lines and plasmids were deposited on September 25, 

20 1996, with the American Type Culture Collection, 10801 University Blvd, 

Manassas, Virginia, USA (ATCC) under the provisions of the Budapest Treaty on 
the International Recognition of the Deposit of Microorganisms for the Purpose 
of Patent Procedure and the Regulations thereunder (Budapest Treaty): Plasmid 
pE4/Hygro (accession number 97739), Plasmid pCLF (accession number 97737), 

25 21 1 Cell Line (accession number CRL-12193) and 21 1 A Cell Line (accession 
number CRL-12194) 

The following virus, Ad5.iffgal.AF, was deposited on January 15, 1999, 
with the ATCC as listed above and provided with accession number VR2636. 

Additionally, plasmids pDV60, pDV67, pDV69, pDV80 and pDV90 were 

30 deposited at the ATCC on January 5, 2000 and provided with accession 
numbers PTA-1 144, PTA-1 145, PTA-1 146, PTA-1 147 and PTA-1 148 
respectively. 
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EXAMPLE 5 

Preparation and Use of Adenoviral Packaging Cell Lines 
Containing Plasmids Containing Alternative TPLs 

Plasmids containing tripartite leaders (TPLs) have been constructed. The 

5 resulting plasmids that contain different selectable markers, such as neomycin 

and zeocin, were then used to prepare fiber-complementing stable cell lines for 

use as for preparing adenoviral vectors. 

A. pDV60 

Plasmid pDV60 was constructed by inserting this TPL cassette of SEQ ID 
10 No. 28 into the BamHI site upstream of the Ad5 fiber gene in pcDNA3/Fiber, a 
neomycin selectable plasmid (see, e.g., U.S. application Serial No. 09/482,682 
(also filed as International PCT application No. PCT/USOO/00265 on January 14, 
2000); see also Von Seggern et al. 

(1998) J. Gen Virol., 79: 1461-1468). The nucleotide sequence of pDV60 is 
15 listed in SEQ ID NO: 29. Plasmid pDV60 has been deposited in the ATCC under 
accession number PTA-1 144. 

B. pDV61 

To construct pDV61, an Asp718/Notl fragment containing the CMV 
promoter, partial Ad5 TPL, wildtype Ad5 fiber gene, and bovine growth hormone 

20 terminator was transferred from pCLF (ATCC accession number 97737; and 
described in copending U.S. application Serial No. 09/482,682 (also filed as 
International PCT application No. PCT/USOO/00265 on January 14, 2000);), to a 
zeocin selectable cloning vector referred to as pCDNA3.1/Zeo ( + ) (commerically 
available from Invitrogen and for which the sequence is known). 

25 C. pDV67 

In an analogous process, pDV67 containing complete TPL was 
constructed by transferring an Asp 718/Xbal fragment from pDV60 into 
pcDNA3.1/Zeo( + ) backbone. The nucleotide sequence of pDV67 is set forth in 
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SEQ ID No. 30. Plasmid pDV67 is available from the ATCC under accession 
number PTA-1 145. 

D. pDV69 

To prepare pDV69 containing a modified fiber protein, the chimeric 
5 Ad3/Ad5 fiber gene was amplified from pGEM5TS3H (Stevenson et al. (1995) J. 
Virol., 69: 2850-2857) using the primers 5'ATGGGAT 
CAAGATGAAGCGCGCAAGACCG3' (SEQ ID No. 31) and 

5'CACTATAGCGGCCGCATTCTCAGTCATCTT3' (SEQ ID No. 32), and cloned to 
the BamHI and Notl sites of pcDNA3.1/Zeo( + ) via new BamHI and Notl sites 
10 engineered into the primers to create pDV68. Finally, the complete TPL 

fragment described above was then added to the unique BamHI site of pDV68 
to create pDV69. The nucleotide sequence of pDV69 is listed in SEQ ID No. 33 
and has been deposited in the ATCC under accession number PTA-1 146. 

E. Preparation of Stable Adenovirus Packaging Cell Lines 

15 E1-2a S8 cells are derivatives of the A549 lung carcinoma line (ATCC # 

CCL 185) with chromosomal insertions of the plasmids pGRE5-2.E1 (also 
referred to as GRE5-E1-SV40-Hygro construct and listed in SEQ ID No. 34) and 
pMNeoE2a-3.1 (also referred to as MMTV-E2a-SV40-Neo construct and listed in 
SEQ ID No. 35), which provide complementation of the adenoviral E1 and E2a 

20 functions, respectively. This line and its derivatives were grown in Richter's 
modified medium (BioWhitaker) + 10% FCS. E1-2a S8 cells were 
electroporated as previously described (Von Seggern et al. (1998) J. Gen Virol. , 
79: 1461-1468) with pDV61, pDV67, or with pDV69, and stable lines were 
selected with zeocin (600 //g/ml). 

25 The cell line generated with pDV61 is designated 601. The cell line 

generated with pDV67 is designated 633 while that generated with pDV69 is 
designated 644. Candidate clones were evaluated by immunofluorescent 
staining with a polyclonal antibody raised against the Ad2 fiber. Lines 
expressing the highest level of fiber protein were further characterized. 

30 For the S8 cell complementing cell lines, to induce E1 expression, 0.3 jjM 

of dexamethasone was added to cell cultures 1 6-24 hours prior to challenge 
with virus for optimal growth kinetics. For preparing viral plaques, 5 X 10 5 
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celis/well in 6 well plates are prepared and pre-induced with the same 
concentration of dexamethasone the day prior to infection with 0.5 jjM included 
at a final concentration in the agar overlay after infection. 
F. Development of Cell Lines for Complementation of E17E2a" Vectors 
5 The Adenovirus 5 genome was digested with Seal enzyme, separated on 

an agarose gel, and the 6,095 bp fragment containing the left end of the virus 
genome was isolated. The complete Adenovirus 5 genome is registered as 
Genbank accession #M73260, incorporated herein by reference, and the virus is 
available from the American Type Culture Collection, Manassas, Virginia, U.S.A., 

10 under accession number VR-5. The Seal 6,095 bp fragment was digested 

further with Clal at bp 917 and Bglll at bp 3,328. The resulting 2,41 1 bp Clal to 
Bglll fragment was purified from an agarose gel and ligated into the superlinker 
shuttle plasmid pSE280 {Invitrogen, San Diego, CA), which was digested with 
Clal and Bglll, to form pSE280-E. 

15 Polymerase chain reaction (PCR) was performed to synthesize DNA 

encoding an Xhol and Sail restriction site contiguous with Adenovirus 5 DNA bp 
552 through 924. The primers which were employed were as follows: 
5' end, Ad5 bp 552-585: 

5'-GTCACTCGAGGACTCGGTC-GACTGAAAATGAGACATATTATCTGCCACGGA 
20 CC-3' {SEQ ID No 36) 

3' end, Ad5 bp 922-891: 

5'-CGAGATCGATCACCTCCGGTACAAGGTTTGGCATAG-3' (SEQ ID No. 37) 

This amplified DNA fragment (sometimes hereinafter referred to as 
Fragment A) then was digested with Xhol and Clal, which cleaves at the native 
25 Clal site (bp 917), and ligated to the Xhol and Clal sites of pSE280-E, thus 

reconstituting the 5' end of the E1 region beginning 8 bp upstream of the ATG 
codon. 

PCR then was performed to amplify Adenovirus 5 DNA from bp 3,323 
through 4,090 contiguous with an EcoRI restriction site. The primers which 
30 were employed were as follows: 
5' end, Ad5 bp 3323-3360: 
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5'-CATGAAGATCTGGAAGGTGCTGAGGTACGATGAGACC-3' (SEQ ID No. 38); 
and 

3' end, Ad5 bp 4090-4060: 

5'-GCGACTTAAGCAGTCAGCTG-AGACAGCAAGACACTTGCTTGATCCAAATCC 
5 -3' (SEQ ID No. 39). 

This amplified DNA fragment (sometimes hereinafter referred to as 
Fragment B) was digested with Bglll, thereby cutting at the Adenovirus 5 Bglll 
site (bp 3,382) and EcoRI, and ligated to the Bglll and EcoRI sites of pSE280-AE 
to reconstruct the complete E1a and E1b region from Adenovirus 5 bp 552 
10 through 4,090. The resulting plasmid is designated pSE280-E1. 

A construct containing the intact E1a/b region under the control of the 
synthetic promoter GRE5 was prepared as follows. The intact E1a/b region was 
excised from pSE280-E1, which was modified previously to contain a BamHI site 
3' to the E1 gene, by digesting with Xhol and BamHI. The Xhol to BamHI 
15 fragment containing the E1a/b fragment was cloned into the unique Xhol and 
BamHI sites of pGRE5-2/EBV (U.S. Biochemicals, Cleveland, Ohio) to form 
pGRE5-E1). 

Bacterial transformants containing the final construct were identified. 
Plasmid DNA was prepared and purified by banding in CsTFA prior to use for 

20 transfection of cells. 

Construction of plasmid including Adenovirus 5 E2A sequence. 

The Adenovirus 5 genome was digested with BamHI and Spel, which cut 
at bp 21,562 and 27,080, respectively. Fragments were separated on an 
agarose gel and the 5,518 bp BamHI to Spel fragment was isolated. The 5,518 

25 bp BamHI to Spel fragment was digested further with Smal, which cuts at bp 
23,912. The resulting 2,350 bp BamHI to Smal fragment was purified from an 
agarose gel, and ligated into the superlinker shuttle plasmid pSE280, and 
digested with BamHI and Smal to form pSE280-E2 BamHI-Smal. 

PCR then was performed to amplify Adenovirus 5 DNA from the Smal site 

30 at bp 23,912 through 24,730 contiguous with Nhel and EcoRI restriction sites. 
The primers which were employed were as follows: 
5' end, Ad5 bp 24,732-24,708: 
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5'-CACGAATTCGTCAGCGCTTCTCGTCGCGTCCAAGACCC-3' (SEQ ID No. 40); 
3' end, Ad5 bp 23,912-23,934: 

5'-CACCCCGGGGAGGCGGCGGCGACGGGGACGGG-3' (SEQ ID No. 41) 

This amplified DNA fragment was digested with Smal and EcoRI, and 
5 ligated to the Smal and EcoRI sites of pSE280-E2 Bam-Sma to reconstruct the 
complete E2a region from Ad5 bp 24,730 through 21,562. The resulting 
construct is pSE280-E2a. 

In order to convert the BamHI site at the 3' end of E2a to a Sail site, the 
E2a region was excised from pSE280-E2a by cutting with BamHI and Nhel, and 

10 recloned into the unique BamHI and Nhel sites of pSE280. Subsequently, the 
E2a region was excised from this construction with Nhel and Sail in order to 
clone into the Nhel and Sail sites of the pMAMneo (Clonetech, Palo Alto, CA) 
multiple cloning site in a 5' to 3' orientation, respectively. The resulting 
construct is pMAMneo E2a. 

1 5 Bacterial transf ormants containing the final pMAMneo-E2a were 

identified. Plasmid DNA was prepared and purified by banding in CsTFA. 
Circular plasmid DNA was linearized at the Xmnl site within the ampicillin 
resistance gene of pMAMneo-E2a, and further purified by the phenol/chloroform 
extraction and ethanol precipitation prior to use for transfection of cells. 

20 Transfection and selection of cells. 

In general, this process involved the sequential introduction, by calcium 
phosphate precipitation, or other means of DNA delivery, of two plasmid 
constructions each with a different viral gene, into a single tissue culture cell. 
The cells were transfected with a first construct and selected for expression of 

25 the associated drug resistance gene to establish stable integrants. Individual cell 
clones were established and assayed for function of the introduced viral gene. 
Appropriate candidate clones then were transfected with a second construct 
including a second viral gene and a second selectable marker. Transfected cells 
then were selected to establish stable integrants of the second construct, and 

30 cell clones were established. Cell clones were assayed for functional expression 
of both viral genes. 
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A549 (ATCC Accession No. CCL-185) were used for transfection. 
Appropriate selection conditions were established for G418 and hygromycin B by 
standard kill curve determination. 

Transfection of A549 cells with plasmids including E1 and E2a regions. 
5 pMAMNeo-E2a was linearized with Xmnl with the Amp R gene, introduced 

into cells by transfection, and cells were selected for stable integration of this 
plasmid by G418 selection until drug resistant colonies arose. The clones were 
isolated and screened for E2a expression by staining for E2a protein with a 
polyclonal antiserum, and visualizing by immunofluorescence. E2a function was 

10 screened by complementation of the temperature-sensitive mutant Ad5ts125 

virus which contains a temperature-sensitive mutation in the E2a gene. (Van Der 
Vliet, et al., J. Virology, Vol. 15, pgs. 348-354 (1975)). Positive clones 
expressing the E2a gene were identified and used for transfection with the 7 kb 
EcoRV to Xmnl fragment from pGRE5-E1 , which contains the GRE5 promoted 

15 E1a/b region plus the hygromycin R gene. Cells were selected for hygromycin 
resistance and assayed for E1a/b expression by staining with a monoclonal 
antibody for the E1 protein (Oncogene Sciences, Uniondale, N.Y.). E1 function 
was assayed by ability to complement an E1 -deleted vector. At this point, 
expression and function of E2a was verified as described above, thus 

20 establishing the expression of E1a/b and E2a in the positive cell clones. 

A transfected A549 ( A549 (ATCC Accession No. CCL-185);) cell 

lines showed good E1a/b and E2a expression and was selected for further 
characterization. It was designated the S8 cell line. 

G. Preparation of Adenoviral Vectors Containing Ad5./?gal.AF Genome in S8 
25 Improved Fiber-Complementing Cell Lines 

To prepare adenoviral vectors containing Ad5./?gal.AF (Ad5./ffgal.AF has 

been was deposited the ATCC under accession number VR2636) in S8 cells 

containing alternative forms of TPL for enhancing the expression of fiber 

proteins, the protocol as described in Example 2 for preparing Ad5./?gal.AF in 

30 21 1B cells was followed with the exception of pretreatment with 0.3 //M 

dexamethasone for 24 hours as described above. Thus, viral particles with the 

wildtype Ad5 fiber protein on their surface and containing the fiberless 
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Ad5.jffgal.AF genome were produced in 633 cells. Particles produced in 644 
cells also contained the fiberless Ad5./?gal.AF genome, but had the chimeric 
5T3H fiber protein, with the Ad3 fiber knob, on their surface. 

Thus, these viral preparations, prepared as described herein are useful for 
5 targeting delivery of the Ad5.iffgal.AF, Ad5.GFP.AF, or other similarly 

constructed fiberless genome with either wild-type or modified fibers. Preferably 
for purposes herein the fibers are from an Ad serotype D virus, more preferably 
from Ad37. 

EXAMPLE 6 

10 Pseudotyping and Infectivity of Recombinant Adenoviral Vectors 

Produced with Improved Fiber-Complementing Cell Lines 

A. Pseudotyping of Ad5.yffgal.AF 

To verify that adenoviral vectors were produced had altered tropisms, 
viral particles were purified from either 633 or 644 cells and were then Western 
15 blotted and probed with a polyclonal rabbit antibody against the Ad2 fiber 
(which detects the Ad5 and chimeric 5T3H fiber proteins). 

B. Infectivity of Cells with 633 or 644 Generated Virus Particles The 
cell lines, 633 or 644, prepared as described above, were infected with the 
indicated number of particles/cell of Ad5./?gal.AF and virus particles produced. 

20 Virus was then used to infect selected cell lines, including 21 1B, MRC-5 human 
fibroblasts, A-10 rat aortic endothelial cells, and THP-1 human monocytic cells. 
Unbound virus was removed by washing the cells and the cells were further 
incubated at 37 °C for 48 hours. Cells were then fixed with glutaraldehyde and 
stained with X-gal. The percentage of stained cells was then determined by light 

25 microscopy where all experiments were done in triplicate. 

The results indicated that adenoviral vectors could be retargeted by 
pseudotyping using packaging cell lines expressing different fiber proteins. 
Particles containing either fiber were equally infectious on 21 1B cells, while 
MRC-5 fibroblasts and THP-1 cells were more readily infected by virus containing 

30 the chimeric fiber. The A-10 rat endothelial cells were more readily infected by 
particles containing the wildtype Ad5 fiber protein. 
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EXAMPLE 7 

Transient Transcomplementation 

The ability of adenovirus type 5 (Ad5) to deliver therapeutic genes to cells 
is mediated by the interaction of the adenoviral fiber protein with the 
5 coxsackievirus-adenoviral receptor (CAR). Because a wide-range of cells express 
CAR, it was thought that it would be difficult to use adenoviruses to deliver 
genes to specific cell types. A system for testing modified fiber genes to 
identify tropisms of interest is described in copending U.S. application Serial 
No. 09/482,682 (also filed as International PCT application No. 

10 PCT/USOO/00265 on January 14, 2000). An in vitro system has been 

developed that involves infection of tissue culture cells with a fiber-deleted Ad 
and transient co-transfection with a plasmid directing fiber expression. This 
system allows one to produce and evaluate modified fibers expressed on a viral 
particle. This system can be used to produce therapeutic quantities of 

1 5 adenoviral vectors with modified fiber proteins, with such fibers having a new 
tropism added by insertion of a desired ligand into the fiber gene. These fibers 
may also have the natural tropism (i.e. binding to CAR) ablated. 

Plasmids used were pDV60 and pDV55 were prepared as described 
herein and in U.S. application Serial No. 09/482,682 (also filed as International 

20 PCT application No. PCT/US00/00265 on January 14, 2000). pDV60 is an 
pcDNA3.1 -based expression plasmid that contains the CMV promoter, Ad5 
tripartite leader, an intron, and the Ad5 fiber gene sequence. pDV55 contains 
no fiber gene and serves as the negative control. Ad5.jffgal.AF and 21 1B are 
described above. 293T cells are identical to 293 cells except they express an 

25 integrated SV40 large T antigen gene. HDF cells are human diploid fibroblasts. 
293T cells express CAR and or v integrins; HDF cells express or v integrins but no 
CAR. Transfections with fiber expression plasmids were performed with 
Lipof ectamine (GIBCO-BRL) using 20mg DNA and 50ml Lipof ectamine per 1 5cm 
dish. Cells were maintained in DMEM supplemented with 10% fetal bovine 

30 serum. 

The fiber deletion mutation of Ad5./?gal.AF is complemented in trans by 
passaging virions through 21 1B, a cell line that stably expresses functional Ad5 
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fiber. The present system was designed to complement Ad5./?gaI.AF by 
modified fibers expressed from transfected episomal plasmids in 293T cells. The 
result is a simplified and rapid method to incorporate modified fibers on a viral 
particle containing the Ad5.yffgal.AF genome that does not require propagation of 
5 the virus. 

The feasibility of transcomplementation of Ad5./ffgal.AF with episomal 
fiber-expressing plasmids was demonstrated in the following experiment. 293T 
cells were transfected with one of two plasmids: pDV55, which expresses no 
fiber or pDV60, which expresses wildtype Ad5 fiber. Fiber expression persists 

10 for at least six days. Twenty-four hours after transfection, these cells were 
infected at 2000 particles/cell with Ad5.jffgal.AF passaged through 21 1B cells. 
Seventy-two hours later, a crude viral lysate (CVL) was generated by exposing 
the cells to five freeze-thaw cycles. Viral particles were purified by cesium 
chloride gradient centrifugation. The resulting virions incorporated the fiber 

15 expressed from the episomal plasmid, as confirmed by Western blots performed 
with an antibody specific to the Ad5 fiber. 

Episomal plasmid transcomplementation system is suitable for quickly 
expressing and evaluating the properties of modified fibers in the context of a 
viral particle. Episomal plasmid transcomplementation will also be of great utility 

20 for quickly evaluating a bank of modified fibers for other binding properties, 

including new tropisms and the ablation of the native tropism. In addition to the 
rapid generation and testing of large numbers of modified fibers, there are other 
advantages to the Ad5.jffgal.AF transcomplementation system in terms of 
production and safety. Episomal plasmid transcomplementation has the inherent 

25 advantage over transcomplementation in that it is not necessary to make a 

stable cell line for every modified fiber for complementation with Ad5.jffgal.AF. 
Because the Ad5.jffgal.AF is deleted in E1, E3 and fiber, there is an additional 
gene deletion, which should render it very suitable for gene therapy. In addition, 
the presence of the fiber gene deletion decreases the opportunity to generate 

30 replication-competent virus via recombination in the packaging cells. A single 
Ad vector preparation can be retargeted to any number of different cell types 
simply by transfecting the cells with the appropriate fiber-expression construct. 



WO 01/83729 



PCT/EP01/04863 



-83- 
EX AMPLE 8 

Preparation of Adenoviral Gene Delivery Vectors Containing the Ad37 fiber 
protein 

This example describes construction of packaging cell lines expressing 
5 the Ad37 fiber protein, and their use in generating particles of a fiber-deleted Ad 
vector {such as Ad5.jffgal.AF) containing this fiber protein. The fiber protein is 
attached to the viral capsid by binding to the penton base protein through its N- 
terminus, and the Ad37 fiber was modified in order to make its N-terminal 
sequence more closely match that of the Ad5 protein to ensure that it would 
10 efficiently bind the Ad5 penton base in these vectors. 

A. Materials and methods 

Cell lines and wild-type adenovirus. Human A549 lung carcinoma 
epithelial cells and human Chang C conjunctival cells (American Type Culture 
Collection) were maintained in complete Dulbecco's Modified Eagle Medium 

15 (DMEM) with 10% fetal bovine serum. Wild-type Ad19p and Ad37 (ATCC) were 
propagated in A549 cells and purified by banding on CsCI 2 density gradients as 
previously described (Huang eta/. (1999) J. Virol. 75:2798-2802). Viral protein 
concentration was determined by the Bio-Rad Protein Assay, and was used to 
calculate the number of viral particles based on the known molecular weight of 

20 Ad2 virions (1 /vg = 4 x 10 9 particles). 

B. Construction of the Ad37 fiber expressing cell lines and the recombinant 
Ad37 knob protein. 

1 . Construction of an Expression Plasmid for the Ad37 Fiber Protein 
(pDV80) 

25 The plasmid designated pDV80 (see, SEQ ID No. 42) prepared for 

expression of the Ad37 fiber protein in mammalian cells, uses the same 
regulatory elements as the elements in pDV60, pDV67, and pDV69 to express 
the Ad37 fiber in packaging lines. It was constructed in two steps. 

First, the Ad37 fiber open reading frame was amplified from Ad37 

30 genomic DNA using synthetic oligonucleotide primers, L37: 5' TGT CCT GGA 
TCC AAG ATG AAG CGC GCC CGC C CC AGC GAA GAT GAC TTC 3'(SEQ ID 
NO. 43) and 37FR: 5' AAA CAC GGC GGC CGC TCT TTC ATT CTT G 3' SEQ 
ID NO. 44). L37 contains nucleotides (underlined) that differ from the Ad37 
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genomic sequence in order to add a unique £?a/7?H1 site (bold) before the start 
codon (italicized) and to create point mutations that make the N-terminal 
sequence of the fiber more closely match the N-terminal sequence of the Ad5 
fiber protein as follows: 
5 Ad37 MSKRLRVEDDFNPVYPY (SEQ ID No. 45) 

\Jf ^Jf ^ ^ %^ 

KRARPS (SEQ ID No . 46) 

Ad5 MKRARPSEDTFNPVYPY (SEQ ID No . 47) . 

37FR also incorporates a unique A/of1 site (bold). The PCR product was inserted 
10 into the BamH'\ and A/of1 sites of pCDNA3.1 zeo( + ) (Invitrogen) to create 
pDV78. The correct sequence of the Ad37 fiber protein, including inserted 
changes, was confirmed by sequencing. 

Two point mutations in the fiber gene in the 705 line, S356 to P356 and 
I362 to T362, were discovered by the sequencing. The mutations are not in the 
15 receptor binding domain in Ad37 fiber gene in the 705 cell line. They are buried 
in the knob trimer interface. To confirm that the these mutations do not affect 
receptor binding, the Ad37 fiber protein with the correct sequence was recloned, 
and 293T cells transfected with the virus and subsequently infected with 
Ad5.GFPAF to produce Ad37 pseudotyped virus. The results were the same as 
20 the results of the experiments with Ad37 pseudotyped virus produced from line 
705 (see, Wu et al. (2001) Virology 275:78-89). 

Second, a 1.2 kb Bam H 1/Bg/W fragment containing an adenovirus type 5 
tripartite leader was excised from pDV55 (see EXAMPLE 3) and inserted into the 
Bam H1 site of pDV78 to create pDV80 (SEQ ID No 42). Plasmid pDV80 has 
25 been deposited in the ATCC under accession number PTA-1 147. 

2. Construction of the recombinant Ad37 knob protein 
Recombinant Ad37 knob protein containing an N-terminal T7»Tag was 
produced in E. co/i using the PET expression system (Novagen). Ad37 fiber DNA 
(GenBank accession number U69132) was PCR amplified from wild-type Ad37 
30 genomic DNA using the following primers (SEQ ID Nos. 48 and 49): 
5' GGATC0 4 7GGG ATACTTGGTAGCA 3' (BamH\ site underlined and 
5' GCAA CTCGAG TCATTCTTGGGCAATATAGG 3'(Xhol site underlined). 
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The PCR reactions were performed at 94 °C (denaturation), 55 °C 
(annealing), 72 °C (extension, 30 cycles) using Tag DNA polymerase (Qiagen). 
The amplified DNA fragments, which contained residues 172 to 365 of the Ad37 
fiber protein with the addition of an N-terminal start codon (italicized), were 
5 purified and subcloned into the pCR-TOPO vector using the TA-Cloning Kit 
(Invitrogen). No replication errors were found by DNA sequencing. Plasmids 
from cultured transformed colonies were purified and digested with BarnH\ and 
Xho\. The fragment was inserted into the BamYW and Xho\ sites of the bacterial 
expression vector, pET21a (Novagen), and transformed into (DE3)pLYS S 
10 expression cells (Invitrogen). Colonies were selected for knob expression by 
induction with 1 mM IPTG for four hours at 37 °C and knob expression was 
determined by SDS-PAGE. The colony displaying highest knob expression was 
used for large-scale knob expression and induced with 0.5 mM IPTG at 30 °C for 
four hours. 

15 The recombinant T7*Tagged Ad37 knob protein was purified from 

sonicated bacteria using the T7»Tag Affinity Purification Kit as recommended by 
the manufacturer (Novagen). Recovered protein was analyzed for purity by SDS- 
PAGE followed by Coomassie staining or Western blotting with an HRP- 
conjugated or-T7»Tag monoclonal antibody as described by the manufacturer 

20 (Novagen) or an a-Ad37 fiber rabbit antibody. 

3. Preparation of Cell Lines that Express the Ad37 fiber protein 
Plasmid pDV80 DNA was purified using the Qiagen method and 
electroporated into the adenovirus-complementing cell line E1-2a S8 (see 
Examples herein; see also, Gorziglia et al. (1 996) J. Virology 70:4173-4178; 

25 and Von Seggern et al. (1998) J. Gen. Virol. 75:1461-1468). Stable clones were 
selected with 600//g/ml zeocin (Invitrogen). 

Clones were expanded and were screened for fiber expression by indirect 
immunofluorescence (Von Seggern et al. (1998) J. Gen. Virol. 75:1461-1468) 
using a rabbit polyclonal antibody directed against the Ad37 fiber (a-Ad37 fiber 

30 rabbit antibody) raised by immunizing rabbits with recombinant Ad37 fiber 
protein. Two clones (lines 705 and 731) that expressed the protein at a 
uniformly high level were selected. 
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EXAMPLE 9 

Production of Pseudotyped Ad Vector Particles 

To generate vector particles equipped ('pseudotyped') with the Ad37 
fiber protein, the Ad37 fiber-expressing 705 cells were infected (approximately 
5 1000 particles/cell) with Ad5.£gal.AF or with Ad5.GFP.AF. 
Materials and methods 

Ad5.j5gal.AF 

The construction of Ad5./ffgal.AF is described in Example 2 (it has been 
deposited on January 15, 1999, with the ATCC as listed above under accession 
10 number VR2636; see also, Von Seggern et al. (1999) J.Virol. 73:1601-1608; 
copending U.S. application Serial No. 09/482,682 filed January 14, 2000, and 
also International PCT application No. PCT/US00/00265, filed January 14, 
2000). 

Ad5.GFP.AF 

15 Ad5.GFP.AF was constructed by recombination in bacteria using a 

modification of the AdEasy System (see, U.S. Patent No. 5,922,576; see, also 
He et al. (1998) Proc. Natl. Acad. Sci. U.S.A. 35:2509-2514; the system is 
publicly available from the authors and other sources). 

First, a fiber-deleted genomic plasmid was constructed by removing the 

20 fiber gene from pAdEasy-1 (see, U.S. Patent No. 5,922,576; and He et al. 

(1998) Proc. Natl. Acad. Sci. U.S.A. 35:2509-2514; the AdEasy system and 
vectors are publicly available from He et al. at Johns Hopkins University). 
Plasmid pAdEasy-1 contains the entire Ad5 genome, except for nucleotides 1- 
3,533, which encompass the E1 genes, and nucleotides 28„1 30-30,820, which 

25 encompass the E3 gene. 

Plasmid pDV43 (see Example 2; see, also Von Seggern et al. (1999) 
J.Virol. 73:1601-1608) was digested with Pad , the ends blunted by treatment 
with the large fragment of E. coli DNA polymerase and dNTPs, and the product 
re-Iigated to produce plasmd pDV76. The resulting plasmid pDV76 is identical to 

30 pDV43 except for loss of the Pad site and contains the right end of the Ad5 
genome with E3 and fiber deletions. A 4.23 kb fragment from PDV76 was 
amplified using the oligonucleotide primers (SEQ ID Nos. 50 and 51: 
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5' CGC GCT GAC TCT TAA GGA CTA GTT TC 3', including the unique Spe1 
site in the Ad5 genome (bold); and 5' GCG CTT AAT TAA CAT CAT CAA TAA 
TAT ACC TTA TTT T 3', including a new Pac\ site (bold) adjacent to the right 
Ad5 ITR. Hence the resulting PCR amplified fragment contains nucleotides 
5 27,082 to 35,935 of the Ad5 genome with deletions of nucleotides 28,133 to 
32,743 (the E3 and fiber genes), and was used to replace the corresponding 
Spe1/Pac1 fragment of pAdEasy 1 (see, U.S. Patent No. 5,922,576) to create 
pDV77. 

Second, E. coil strain BJ5183 was electroporated with a mixture of 
10 pDV77 and Pme\ -linearized pAdTrack as described (U.S. Patent No. 5,922,576; 
He et al. (1998) Proc. NatL Acad, Sci. U.S.A. 35:2509-2514), and DNA was 
isolated from kanamycin-resistant colonies. The resulting plasmid, pDV83, 
contains a complete Ad5 genome with E1-, E3-, and fiber-deletions with a CMV- 
driven GFP reporter gene inserted at the site of the E1 deletion. The full length 
15 Ad chromosome was isolated by Pac1 digestion, and transfected into the El- 
and fiber-complementing 633 cells (Von Seggern eta/. (2000) J.Virol. 74:354- 
362). The 633 cells were produced by electroporating pDV67 (SEQ ID No. 30, 
deposited under ATCC accession number PTA-1 145) into the E1-2a S8 cells, 
described above. The recovered virus Ad5.GFP.AF was then plaque purified by 
20 plating on 633 cells and virus stocks were prepared by freeze-thawing cell 
pellets. 

Ad5-pseudotype particle production 
Particles with Ad5 fiber 

Ad5-pseudotyped particles were generated by virus growth in 633 cells, 
25 which express the wild type Ad5 fiber protein. Viral particles were isolated and 
purified over CsCI gradients (Von Seggern et al. (1999) J.Virol. 73:1601-1608; 
purified by centrifugation on preformed 1 5-40% CsCI gradients (111 ,000 x g for 
three hours at 4°Q). For analysis of viral proteins, ten jjg of the purified 
particles were electrophoresed on 8-16% gradient gels and the protein 
30 transferred to nylon membranes. The resulting blot was probed with rabbit 
polyclonal antibodies raised against recombinant Ad37 fiber or Ad5 fiber or 
penton base proteins expressed in baculovirus-infected cells. 
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Particles with Ad37 fiber 

Cells from the Ad37 fiber producing cell line 705 were infected at 
approximately 1000 particles/cell with Ad5.£gal.AF or with Ad5.GFP.AF. Viral 
particles were isolated and purified over CsCI gradients. The bands were 
5 harvested, dialyzed into storage buffer (10 mM Tris-pH 8.1, 0.9% NaCI, and 
10% glycerol), aliquoted and stored at -70°C. 
Viral protein analyses 

For analysis of viral proteins, 10 /jg of purified Ad5.yffgal.AF particles with 
no fiber (grown in 293 cells), the Ad5 fiber (grown in 633 cells), or the Ad37 
10 fiber (grown in 705 cells) were electrophoresed by 8-16% polyacrylamide 

gradient SDS-PAGE and the proteins were transferred to nylon membranes. The 
blot was then probed with a-Ad37 fiber rabbit antibody. Ad5 fiber and penton 
base were detected by reprobing the blot with polyclonal antibodies raised 
against recombinant proteins expressed in baculovirus-infected cells (Wickman et 
15 al. (1993) Cell 2:309-319). 

Adenovirus infection and cell binding assays 

Adherent Chang C and A549 cells were infected with GFP expressing 
Ad5 vectors containing the Ad5 fiber (Ad5.GFP.AF/5F) or the Ad37 fiber 
(Ad5.GFP.AF/37F) at 10,000 particles per cell for 3 hours at 37 °C, 5% C0 2 in 

20 DMEM, 10% FCS. Cells were washed twice with saline and then cultured 

overnight at 37° C, 5% CG 2 . The next day, the cells were detached with buffer 
containing 0.05% (w/v) trypsin and 0.5 mM EDTA (Boehringer Mannheim) for 5 
minutes at 37 °C. Suspended cells were washed once with PBS and then 
resuspended in phosphate-buffered saline (PBS), pH 7.4. GFP fluorescence was 

25 measured with a FACScan flow cytometer. A threshold established by the 
fluorescence of uninfected cells was used to distinguish cells expressing GFP. 
To assess the role of CAR in Ad infection, 1 0,000 attached cells were pre- 
incubated with 1 80 //g/ml RmcB, a function-blocking anti-CAR monoclonal 
antibody (Hsu et al. (1988) J. Virol. 52:1647-1652), in complete DMEM for 1 

30 hour at 4°C. A small volume containing Ad5.GFP.AF/5F or Ad5.GFP.AF/37F 
was then added at 10,000 particles per cell. The cells were infected for 3 
hours, cultured overnight, harvested, and analyzed for GFP expression. Percent 
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ceils expressing GFP was determined by the percent of cells detected above a 
threshold set by the fluorescence of uninfected Chang C cells. 

To measure adenovirus binding to cells, wild type Ad37 was labeled with 
125 l using lodogen (Pierce) according to manufacturer instructions and separated 
5 from free 125 l by gel filtration as described (Huang et al. (1999) J. Virol. 

73:2798-2802). Binding of radiolabeled wild type Ad37 on Chang C cells was 
then quantitated as described (Huang et al. (1999) a Virol. 73:2798-2802). 
Non-specific binding was determined by incubating cells and labeled Ad37 
particles in the presence of 100-fold concentration of unlabeled Ad37. Specific 

10 binding was calculated by subtracting the non-specific binding from the total 
cpm bound. To examine if divalent cations are required for binding, 10 mM 
ethylenediaminetetraacetic acid (EDTA) or various concentrations of CaCI 2 , or 
MgCI 2 were added to cells before incubation with labeled virus. To examine if 
the receptor for Ad37 is a protein, cells were pretreated with 10 //g/ml trypsin 

15 (GIBCO), subtilisin (Sigma), proteinase K (Boehringer-Mannheim), and bromelain 
(Sigman) at 37 °C for 1 hour, then washed twice with complete DMEM before 
adding labeled virus. Cells were >95% viable after protease treatment. 

Ad37 binding to conjunctival cells is calcium-dependent. Specific 125 l- 
labeled Ad37 binding to Chang C cells was measured in the presence of 10 mM 

20 EDTA and in the presence of varying concentrations of calcium chloride or 
magnesium chloride. Specific binding was determined by subtracting the 
nonspecific counts in the presence of 1 00-fold excess unlabeled virus from the 
total counts. 

Pretreatment of conjunctival cells with proteases inhibits Ad37 binding. 

25 Change C cells were pretreated with various proteases for 1 hour before binding 
125 l-labeled Ad37 to the cells. Nonspecific binding was measured by adding 
100-fold unlabeled Ad37 to cells with 125 l-labeled Ad37 and subtracting from 
total counts for specific binding. Percent inhibition represents the difference in 
specific binding of untreated cells and pretreated cells as a percentage of the 

30 specific binding of untreated cells. 
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Vlrus overlay protein blot assay (VOPBA) 

For VOPBA of human conjunctival membrane proteins probed with Ad37 
in the presence of EDTA or calcium chloride, Chang C membrane fractions were 
separated by 8% SDS-PAGE and transferred to a PDVF membrane. The 
5 membrane was subsequently probed with or without whole Ad37 particles, a 
polyclonal antibody against Ad37 fiber, and finally a horseradish peroxidase 
conjugated anti-rabbit antibody, in the presence of EDTA or calcium chloride. 
Transferred Chang C membrane proteins were probed with recombinant Ad37 
knob protein, instead of Ad37 knob, in the presence of calcium chloride. 

10 Confluent monolayers of Chang C and A549 cells were detached by 

scraping, pelleted by centrifugation, and then resuspended in 250 mM sucrose, 
20 mM HEPES, pH 7.0, 1 mM EDTA, and 2 //g/ml aprotinin and leupeptin. Cells 
were transferred into a dounce homogenizer and disrupted with 30 strokes. 
Organelles and nuclei were pelleted at 500g for 1 5 min. Plasma membrane 

1 5 fragments were then pelleted from the supernatant of cell lysates at 200,000g 
for 1 hour and then resuspended in 10 mM Tris*CI, pH 8.1, 10 //g/ml aprotinin 
and leupeptin. 

Cell membranes of Chang C or A549 cells were incubated (1:1) with a 
2% SDS, non-reducing buffer and separated on an 8% polyacrylamide gel 

20 without boiling. Membrane proteins were then electroblotted onto a PVDF 

membrane (Immobilon-P) and blocked in 5% (w/v) milk in PBS, pH 7.4, 0.02% 
Tween-20 (PBS-T). After blocking, the membrane was incubated with 1 //g/ml 
wild-type Ad19p or Ad37 in 0.5% (w/v) milk in PBS-T, 1 mM CaCI 2 , for 1 hour 
at room temperature. The membrane was then washed once with phosphate- 

25 buffered saline, pH 7.4 (PBS), 1 mM CaCI 2 , and incubated with 1 :500 dilution of 
<7-Ad37 fiber rabbit antibody in 0.5% (w/v) milk in PBS-T, 1mM CaCI 2 , for 30 
minutes at room temperature. The membrane was washed again with PBS, 1 
mM CaCI 2 , and incubated with 1 :5000 dilution of horseradish peroxidase (HRP) 
conjugated or-rabbit antibody (Sigma) in 0.5% (w/v) milk in PBS-T, 1 mM CaCJ 2 , 

30 for 30 minutes at room temperature. The membrane was washed four times in 
PBS, 1 mM CaCI 2 , once with PBS-T, 1 mM CaCI 2 , and once in 1 mM CaCI 2 . The 
blot was developed with enhanced chemiluminescence reagents (Pierce) for 5 
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minutes and placed onto a piece of Biomax film (Kodak) for 5 seconds to 1 

minute. For divalent metal cation experiments, membranes were incubated in 

the presence of 2 mM EDTA instead of 1 mM CaCI 2 in all solutions. To assay 

fiber knob binding to cell membrane proteins, membrane filters were incubated 

5 with 1 //g/ml purified T7-tagged Ad37 knob protein in Tris-buffered saline, 0.1% 

Tween-20, 1 mM CaC1 2/ for 1 hour at room temperature. a-Ad37 fiber rabbit 

antibody and HRP-conjugated anti-rabbit antibody were applied and the 

membrane was developed with substrate solution as described above. 

Results: Comparison of adenovirus infection of human conjunctival and liing 
10 epithelial cells with virus particles retargeted with Ad5 or Ad37 fiber proteins 

Packaging cell lines producing the Ad37 fiber protein were generated. 

Since the N-terminal amino acid sequences of the Ad5 and Ad37 fiber proteins 

differ significantly, and to ensure that the Ad37 fiber would be efficiently 

incorporated into Ad5 vector particles, several residues in the wild-type Ad37 

15 fiber were mutated to more closely match the Ad5 sequence. Stable cell lines 
producing this fiber under control of the CMV promoter and the adenovirus type 
5 tripartite leader were then generated and screened for fiber expression by 
indirect immunofluorescence. One clone (line 705), which expressed the Ad37 
fiber at a high level, was selected for further study. 

20 Cells from one cell line 633, which expresses the wild-type Ad5 fiber 

protein, and line 705 were infected with a fiber-deleted Ad5 vector carrying a 
jffgalactosidase reporter gene. The resulting vector particles contained the Ad5 
fiber protein (Ad5.£gal.AF/5F) and the Ad37 fiber protein (Ad5.j9gal.AF/37F), 
respectively. Incorporation of the correct fiber protein into viral particles was 

25 verified by Western blotting. Adenoviral vectors containing the GFP reporter 
gene, Ad5.GFP.AF/5F and Ad5.GFP.AF/37F, were created in the same fashion. 

Infection of a variety of cell types using the retargeted adenovirus 
particles was examined. As assayed by GFP fluorescence, Ad5.GFP.AF/5F 
exhibited good gene delivery to lung epithelial (A549) and conjunctival cells 

30 (Chang C). In contrast, Ad5.GFP.AF/37F efficiently delivered GFP to Chang C 
cells, but exhibited very poor gene delivery to A549 cells. Although CAR is 
expressed on the surface of A549 cells, as indicated by AD5.GFP.AF/5F 
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infection, Ad5.GFP.AF/37F was unable to infect these cells efficiently. This 
experiment shows that the Ad37 fiber protein can confer preferential infection of 
human conjunctival cells, but not CAR-expressing human lung epithelial cells. 

Hence CAR is not the primary receptor for Ad37. Recent studies reported 
5 that expression of CAR on the surface of Chinese hamster ovary (CHO) cells did 
not improve Ad37 binding (Amberg et al. (2000) J. Virol. 74:42-48), implying 
that Ad37 does not use CAR as a primary receptor. In order to verify this on 
human conjunctival cells, A549 and Chang C cells were pretreated with RmcB 
(Hsu et al. (1988) J. Virol. 52:1647-1652), a function-blocking monoclonal 

10 antibody against CAR. The RmcB antibody inhibited infection of A549 cells by 
Ad5.GFP.AF/5F, but it had little effect on infection of Chang C cells by 
Ad5.GFP.AF/37F. This indicates that CAR is not the primary receptor for Ad37 
on Chang C conjunctival cells. 

Ad37 binding to conjunctival cells requires divalent metal cations. It has 

15 been proposed (Roelvink et al. (1998) J.Virol. 72:7909-7915) that a combination 
of fiber binding to CAR and penton base binding to cr v -integrins allows some 
adenovirus serotypes to attach to cells. Although or v -integrin binding to the RGD 
motif of the adenovirus penton base is of relatively low affinity (Wickman et al. 
(1993) Cell 2:309-319), it may nonetheless contribute to viral attachment to the 

20 cell surface. Ad37 shows a particularly strong affinity for binding to integrin Ggff 5 
(Mathias et al. (1998) J.Virol. 72:8669-8675), suggesting that integrin ogff 5 
might be a primary receptor for Ad37. Binding of the RGD motif by a v -integrins 
requires the presence of divalent cations, such as calcium or magnesium (Stuiver 
et al. (1996) J. Cell Physiol. 755:521-531 ). In contrast, no divalent cations were 

25 required for binding in the CAR-Ad12 knob complex (Bewley et aL (1999) 
Science 255:1579-1583). 

To investigate the potential role of a v -integrins and divalent metal cations 
in Ad37 receptor binding, 125 l-labeled Ad37 binding to Chang C cells was 
examined in the absence or presence of EDTA. EDTA inhibited Ad37 binding to 

30 conjunctival cells but did not alter Ad5 binding. These findings suggest a 
requirement for divalent metals for Ad37 binding. 
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The presence of either calcium or magnesium ions helps a v /? 5 organize in focal 
contacts (Stuiver et aL (1996) J. Cell Physiol. 755:521-531), suggesting that 
calcium and magnesium aid in integrin a v J3 s function. To further test the 
potential role of integrin a y fi 5 in Ad37 cell attachment, 125 Mabeled Ad37 binding 
5 to Chang C cells was measured in the presence of varying concentrations of 
calcium or magnesium chloride. Magnesium ions had little effect on Ad37 
binding to Chang C cells. In contrast, calcium ions dramatically enhanced Ad37 
binding to Chang C cells. The optimal concentration of calcium chloride for 
Ad37 binding was 1 mM, while higher concentrations of calcium actually 

10 decreased virus binding to cells. The fact that calcium, but not magnesium, 
promoted Ad37 attachment is not consistent with integrin a v fi 5 as the primary 
receptor for viral attachment to the cells since either metal will support ligand 
binding to integrin a v /? 5 . Moreover, A549 cells express abundant a v -integrins 
(Mathias et al. (1998) J. Virol. 72:8669-8675) but were unable to support 

15 efficient binding of Ad37. 

Wild-type Ad37 particles bind to three conjunctival membrane proteins. 
Recent studies reported that protease treatment of CHO cells abolished Ad37 
binding (Arnberg et al. (2000) J. Virol. 74:4-2-48), implying that Ad37 bound to 
a protein receptor on CHO cells. Scatchard analysis of Ad37 binding to Chang C 

20 cells showed that each cell expresses approximately 24,000 fiber binding sites 
(Huang et aL (1999) J. Virol. 73:2798-2802). To determine if the Ad37 binding 
site on human conjunctival cells is also a protein, Chang C cells were treated 
with different proteases prior to measuring binding of 125 l-labeled Ad37. 
Digestion of surface proteins by all four proteases inhibited Ad37 binding to 

25 Chang C cells by greater than 50%. This finding showed that Ad37 also binds 
to a protein receptor on Chang C cells. 

Virus overlay protein blot assays (VOPBAs) were used to identify 
candidate viral protein receptors. This Western blot technique uses intact viral 
particles in place of antibodies to probe viral-receptors interactions. VOPBA was 

30 used herein to identify Chang C membrane proteins that bind to Ad37. In the 
absence of Ad37 particles, no protein bands were observed, while addition of 
virus in the absence of calcium revealed binding to a single 45 kDa protein. In 
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the presence of 1 mM calcium chloride, Ad37 reacted with three proteins with 
approximate molecular weights of 45, 50 and 60 kDa. The same three proteins 
were detected using a recombinant Ad37 fiber knob alone, indicating that Ad37- 
receptor interactions are fiber mediated and do not involve interactions of other 
5 capsid proteins such as the penton base. The size of the calcium-independent 
protein (45 kDa) is very similar to the known molecular weight of CAR. A direct 
comparison of the Ad37 VOPBA and a CAR Western blot showed that the 45 
kDa receptor co-migrates with CAR on SDS-PAGE. Moreover, two other 
members of subgroup D adenoviruses, Ad9 and AD15, have been shown to bind 

10 to CAR (Roelvink et aL (1998) J.Virol. 72:7909-7915). 

Since CAR does not appear to mediate Ad37 binding on intact Chang C 
cells, the possibility that the 50 or 60 kDa protein serves this function was 
tested by examining an adenovirus serotype that does not bind to Chang C cells. 
Ad19p, a closely related subgroup D adenovirus, binds poorly to Chang C cells 

15 (Huang et aL (1999) J. Virol. 75:2798-2802) and Ad19p recognition of the 
Ad37 receptor is therefore unlikely. Ad19p particles bound to the 45 and 60 
kDa receptors in the VOPBA, but did not bind to the 50 kDa receptor. 
Moreover, the 50 kDa receptor is expressed on Chang C cells, but not A549 
cells, which only support low levels of Ad37 binding and infection. Taken 

20 together, these data indicate that the 50 kDa protein is a primary candidate 
receptor for Ad37 on human conjunctival cells. 
Discussion 

The identification of the CAR protein as a major adenovirus receptor does 
not explain why certain subgroup D members, such as Ad37, preferentially 

25 infect ocular cells. A 50 kDa human conjunctival cell membrane protein is 

identified herein as a primary candidate for the receptor for Ad37. This 50 kDa 
protein is not present on A549 lung epithelial cells. Ad37 binding to this receptor 
is calcium-dependent, which is consistent with Ad37 binding and infection 
experiments. Ad37 also bound to a 60 kDa protein that is present on human 

30 conjunctival and lung epithelial cells. It does not, however, appear to be 

serotype specific. The molecular weights of MHC class I heavy chain, which has 
been proposed as a receptor for Ad5, and a v /? 3 and orj9 5 intergrins, receptors for 
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the penton base, are distinct from the 50 or to kDa receptor characterized in this 
study. 

The studies of Ad37-receptor interaction using VOPBAs are consistent 
with previous studies showing that subgroup D adenoviruses can bind to the 
5 extracellular domain of CAR (Roelvink et al. (1998) J.Virol. 72:7909-7915). 
Biochemical and structural studies on knob-CAR interactions indicate that the 
CAR binding site is located on the AB-loop of the fiber knob. Alignment of the 
fiber sequences of Ad37 and other adenoviruses reveals that the AB-loop of 
Ad37 is similar to those of Ad 12 and Ad5. Moreover, a phylogenetic tree of 

10 adenovirus knobs (Roelvink et al. (1998) J.ViroL 72:7909-7915) shows that 

fiber proteins of subgroup D are similar to those of subgroup C and E, which use 
CAR as their primary receptor. Ad37 does not, however, appear to effectively 
use CAR as a primary receptor, as demonstrated by virus binding and infection 
studies on Chang C conjunctival cells and A549 lung epithelial cells; 

15 It has been reported that Ad37 uses sialic acid as a receptor on Chinese 

hamster ovary (CHO) cells and human lung carcinoma (A549) cells (Arnberg et 
al. ((2000) J. Virol. 74:42-48). Human conjunctival cells were not studied. 
Human corneal epithelial (HCE) cells were the only ocular cell line studied and 
Ad37 binds relatively poorly to these cells, compared to binding on A549 cells 

20 (Arnberg et al. ((2000) J. Virol. 74:42-48). In addition, 8,4 X 10 7 wheat germ 
agglutinin molecules per cell were required to significantly inhibit Ad37 binding 
to sialic acid on sialic acid positive CHO cells (Arnberg et al. (2000) J. Virol. 
74:42-48), three orders of magnitude higher than the number of Ad37 receptors 
on Chang C conjunctival cells (Huang et al. (1 999) J. Virol. 73:2798-2802). 

25 Clearly, sialic acid is not the only factor responsible for Ad37 binding to the cell 
surface and its influence on Ad37 tropism is unclear. 

The results herein show that Ad37 selects a 50 kDa cellular receptor for 
binding to conjunctival cells, but it is possible that sialic acid also plays a role in 
this interaction. The characterization and identification of the Ad37 receptor 

30 have therapeutic implications and also explain the different tropism of Ad37. 
The 50 kDA receptor for Ad37 may also be the receptor for other subgroup D 
adenoviruses that cause severe cases of EKC, Ad19a and Ad8. Ad19p is a 
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nonpathogenic variant of Ad19 (Arnberg et al. (1998) Virology 227:239-244) 
while Ad19A, along with Ad8 and Ad37, are major causes of EKC. Ad 19a and 
Ad37 have identical fiber proteins (Arnberg et al. (1998) Virology 227:239-244) 
and have similar tropism in vivo. Ad8, Ad19a, and Ad37 agglutinate dog and 
5 guinea pig erythrocytes more effectively than four other serotypes that are 
associated with less severe forms of conjunctivitis (Arnberg et aL (1998) 
Virology 227:239-244), implying that the receptors of Ad18, Ad19A, and Ad37 
have similar characteristics. Hence, this 50 kDa receptor is an attractive drug 
target against EKC caused by adenoviruses to provide therapeutic intervention of 

10 ocular diseases associated with these viruses. 

EXAMPLE 10 
Targeting of the Ad5 vector to photoreceptor cells 

The fiber-deleted adenovirus vector Ad5.GFP.AF was propagated in 705 
cells, which express a modified Ad37 fiber protein. Viral particles 

15 (Ad5.GFP.Af/37F) were harvested, CsCI-purified and dialized into 0.9% NaCI, 10 
mM Tris, pH 8.1, and 10% glycerol. Two to three p\ of the resulting solution, 
containing approximately 1 x 1 0 9 particles///! was injected into the vitreous 
chamber of a mouse eye. Seven days post-injection, eyes were harvested, fixed 
with paraformaldehyde and cryo-sectioned. Sections were stained with an anti- 

20 rhodopsin antibody to identify photoreceptor cells and with DAPI to show all cell 
nuclei. The resulting sections showed red anti-rhodopsin staining in the 
photoreceptors, blue DAPI-stained nuclei, and green GFP staining in any 
transduced cells. The results revealed substantially exclusive transduction of 
photoreceptors. Co-localization of rhodopsin staining and GFP expression 

25 indicated selective transduction of photoreceptor cells. 

As a control, contralateral eyes were injected with a stock of the fiber- 
deleted vector AD5.yffgal.AF grown in the same Ad37 fiber-expressing cells. 
Since this virus (Ad5.jffgal.AF/37F) produces yffgal rather than GFP, the green 
staining is absent from the photoreceptors. 

30 Additional experiments using the AD37 fiber for targeting to the 

photoreceptor cells have been performed. Subretinal and intravitreal injection 
have been used in mouse models and the results demonstrate targeting to the 
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photoreceptors. As with intra vitreally injected eyes, the major cell type infectd 
via subretinal administration was the photoreceptor. 

As noted, Ad5.GFP.AF /37F infected Chang C cells efficiently, but A549 
cells poorly. Ad37 fiber protein confers preferential infection on human 
5 conjunctival cells, but not CAR-expressing human lung epithelial cells. Binding 
to conjunctival cells requires divalent cations. 



Since modifications will be apparent to those of skill in this art, it is 
intended that this invention be limited only by the scope of the appended claims; 
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WHAT IS CLAIMED IS: 

1 . An isolated nucleic acid molecule, comprising: 
adenovirus inverted terminal repeat sequences; an adenovirus packaging 
signal operatively linked thereto; and a photoreceptor-specific promoter. 
5 2. The isolated nucleic acid molecule of claim 1, further comprising a 

nucleic acid encoding a therapeutic product operatively linked to the promoter. 

3. The isolated nucleic acid molecule of claim 1, wherein the 
promoter is a rhodopsin promoter. 

4. The nucleic acid molecule of claim 1 , wherein the adenovirus 
10 genome does not encode a functional fiber protein such that packaging the 

nucleic acid requires complementation in a packaging cell. 

5. A recombinant adenovirus vector, comprising the nucleic acid 
molecule of any of claims 1-4 packaged therein. 

6. A recombinant adenovirus vector of claim 5, wherein inverted 
15 terminal repeat sequences (ITR) and a packaging signal are derived from 

adenovirus type 2 or adenovirus type 5. 

7. A recombinant adenovirus vector of claim 5, wherein the virus 
comprises a fiber protein. 

8. A recombinant adenovirus vector of claim 7, wherein the fiber 
20 protein selectively binds to photoreceptors in the eye of a mammal. 

9. A recombinant adenovirus vector of claim 7, wherein the fiber is a 
chimera composed of N-terminal sequences from adenovirus type 2 or type 5, 
and a sufficient portion of an adenovirus serotype D fiber for selective binding to 
photoreceptors in the eye of a mammal. 

25 10. A method for targeted delivery of a gene product to the eye of a 

mammal, comprising: 

administering a recombinant adenovirus virus that comprises heterologous 
DNA encoding the gene product or resulting in expression of the gene product, 
wherein the recombinant virus comprises a fiber protein that specifically or 

30 selectively binds to receptors that are expressed on cells in the eye. 
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1 1 . The method of claim 10, wherein the cells are photoreceptors. 

12. The method of claim 10, wherein administration is effected by 
intraocular delivery. 

13. The method of claim 10, wherein administration is effected by a 

5 method selected from subretinal injection, intravenous administration, periorbital 
administration, and intravitreal administration. 

14. The method of claim 10, wherein the recombinant virus comprises 
a fiber protein from an adenovirus type D serotype. 

15. The method of any of claims 10-14, wherein the fiber protein is an 
10 adenovirus type 37. 

16. The method of any of claims 10-14, wherein the fiber is a chimeric 
protein containing a sufficient portion of the N-terminus of an adenovirus type 2 
or type 5 fiber protein for interaction with an adenovirus type 2 or type 5 
penton, and a sufficient portion of an adenovirus serotype D knob portion of the 

15 fiber for selective binding to photoreceptors in the eye of a mammal. 

17. The method of any of claims 10-16, wherein the recombinant virus 
is an adenovirus type D serotype. 

18. The method of any of claims 10-17, wherein the encapsulated 
nucleic acid comprises a photoreceptor-specific promoter operatively linked to a 

20 nucleic acid comprising the therapeutic product. 

19. The method of claim 18, wherein the therapeutic product is 
selected from the group consisting of a trophic factor, an anti-apoptotic factor, a 
gene encoding a rhodopsin protein, a wild-type Stargardt disease gene (STDG1), 
an anti-cancer agent and a protein that regulates expression of a photoreceptor- 

25 specific gene product. 

20. The method of any of claims 10-19, wherein delivery is effected 
for treatment of an ocular disease. 

21. The method of claim 20, wherein the disorder is a retinal 
degenerative disease. 

30 22. The method of claim 20, wherein the disease is retinitis 

pigmentosa, Stargardt's disease, diabetic retinopathies, retinal vascularization, or 
retinoblastoma. 
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23. The method of any of claims 10-22, wherein the mammal is a 
human. 

24. The method of any of claims 10-22, wherein the viral nucleic acid 
comprises: 

5 an adenovirus inverted terminal repeat (ITR) sequences; and 

an adenovirus packaging signal operatively linked thereto. 

25. The method of claim 24, wherein the ITRs and packaging signal 
are derived from an adenovirus serotype B or C. 

26. The method of claim 24, wherein the ITRs and packaging signal 
10 are derived from an adenovirus type 2 or 5. 

27. The method of claim 24, wherein the viral nucleic acid further 
comprises a photoreceptor-specific promoter. 

28. A method of targeted gene therapy, comprising: 
administering a recombinant viral vector that comprises an adenovirus 

15 type 37 fiber protein or portion thereof, whereby the vector selectively 
transduces photoreceptors and delivers a gene product encoded by the 
recombinant viral vector; wherein the portion is sufficient for selective binding to 
photoreceptors. 

29. The method of claim 28, wherein the vector is administered into 
20 the eye. 

30. The method of claim 28, wherein the vector is administered to the 
vitreous cavity of the eye. 

31 . The method of claim 28, wherein administration is effected by 
subretinal injection, intravenous administration, periorbital administration or 

25 intravitreal administration. 

32. The method of any of claims 10-31, wherein at least about 10 7 
plaque forming units of virus are administered. 

33. The method of any of claims 10-31, wherein about 1 plaque 
forming unit to about 10 14 plaque forming units of virus are administered. 
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<110> VON SEGGERN, DANIEL 
NEMEROW, GLEN R. 
FRIEDLANDER, MARTIN 

<120> VECTORS FOR OCULAR TRANSDUCTION AND USE THEREFOR FOR GENETIC THERAPY 

<130> 756.1PCT/NOV0205P 

<140> 

<141> 2001-05-01 

<150> 09/562,934 
<151> 2000-05-01 

<160> 51 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 1 

cggtacacag aattcaggag acacaactcc 3 0 



<210> 2 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 2 

gcctggatcc gggaagttac gtaacgtggg aaaac 3 5 



<210> 3 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: linker 
<400> 3 

cgcggatccg eg 12 



<210> 4 
<211> 8710 
<212> DNA 

<213> Artificial Sequence 



<220> 
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<223> Description of Artificial Sequence: plasmid 



<400> 4 

cacctaaatt 

ctcatttttt 

cgagataggg 

ctccaacgtc 

accctaatca 

gagcccccga 

gaaagcgaaa 

caccacaccc 

gcgcaactgt 

agggggatgt 

ttgtaaaacg 

gccccccctc 

caagtgcata 

tatttgccac 

ttatgtttca 

agtatagccc 

accctagtat 

cggctggcct 

cacacggttt 

tcacttaagt 

tgcttaacgg 

atcaggatag 

gtcctgcagg 

ataaggcgcc 

taactgcagc 

aagctcatgg 

aagtggcgac 

ttcaccacct 

ctaaaccagc 

caatgacagt 

atgttggcac 

gttagaacca 

ggaagacctc 

ggatgatcct 

ctgtacggag 

acgccggacg 

gcgtctccgg 

caaagcatcc 

cctgataaca 

cgagtcacac 

aaagattatc 

cgtggtcaaa 

cttccaaaag 

gaatctcctc 

accttctcaa 

gctccagagc 

ttcctcacag 

taggtccctt 

cacttccccg 

agctatgcta 

gcaaggtgct 

catgctcatg 

ttctctcaaa 

ttaaacatta 

tacggccatg 

cagctcctcg 

catcggtcag 

gagacaacat 

aaacacctga 



gtaagcgtta 
aaccaatagg 
ttgagtgttg 
aaagggcgaa 
agttttttgg 
tttagagctt 
ggagcgggcg 
gccgcgctta 
tgggaagggc 
gctgcaaggc 
acggccagtg 
gaggtcgacg 
ctctatgtca 
atcctcttac 
acgtgtttat 
caccaccaca 
tcaacctgcc 
taaaaagcat 
cctgtcgagc 
tcatgtcgct 
gcggcgaagg 
ggcggtggtg 
aatacaacat 
ttgtcctccg 
acagcaccac 
cggggaccac 
ccctcataaa 
cccggtacca 
tggccaaaac 
ggagagccca 
aacacaggca 
tatcccaggg 
gcacgtaact 
ccagtatggt 
tgcgccgaga 
tagtcatatt 
tctcgccgct 
aggcgccccc 
tccaccaccg 
acgggaggag 
caaaacctca 
ctctacagcc 
gcaaacggcc 
tataaacatt 
tatatctcta 
gccctccacc 
acctgtataa 
cgcagggcca 
ccaggaacct 
accagcgtag 
gctcaaaaaa 
cagataaagg 
catgtctgcg 
gaagcctgtc 
ccggcgtgac 
gtcatgtccg 
tgctaaaaag 
tacagccccc 
aaaaccctcc 



atattttgtt 
ccgaaatcgg 
ttccagtttg 
aaaccgtcta 
ggtcgaggtg 
gacggggaaa 
ctagggcgct 
atgcgccgct 
gatcggtgcg 
gattaagttg 
aattgtaata 
gtatcgataa 
ttttcatggg 
actttttcat 
ttttcaattg 
tagcttatac 
acctccctcc 
catatcatgg 
caaacgctca 
gtccagctgc 
agaagtccac 
ctgcagcagc 
ggcagtggtc 
ggcacagcag 
aatattgttc 
agaacccacg 
cacgctggac 
tataaacctc 
ctgcccgccg 
ggactcgtaa 
cacgtgcata 
aacaacccat 
cacgttgtgc 
agcgcgggtt 
caaccgagat 
tcctgaagca 
tagatcgctc 
tggcttcggg 
cagaataagc 
cgggaagagc 
aaatgaagat 
aaagaacaga 
ctcacgtcca 
ccagcacctt 
agcaaatccc 
ttcagcctca 
gattcaaaag 
gctgaacata 
tgacaaaaga 
ccccgatgta 
tcaggcaaag 
caggtaagct 
ggtttctgca 
ttacaacagg 
cgtaaaaaaa 
gagtcataat 
cgaccgaaat 
ataggaggta 
tgcctaggca 



aaaattcgcg 
caaaatccct 
gaacaagagt 
tcagggcgat 
ccgtaaagca 
gccggcgaac 
ggcaagtgta 
acagggcgcg 
ggcctcttcg 
ggtaacgcca 
cgactcacta 
gcttgatatc 
actggtctgg 
acattgccca 
cagaaaattt 
agatcaccgt 
caacacacag 
gtaacagaca 
tcagtgatat 
tgagccacag 
gcctacatgg 
gcgcgaataa 
tcctcagcga 
cgcaccctga 
aaaatcccac 
tggccatcat 
ataaacatta 
tgattaaaca 
gctatacact 
ccatggatca 
cacttcctca 
tcctgaatca 
attgtcaaag 
tctgtctcaa 
cgtgttggtc 
aaaccaggtg 
tgtgtagtag 
ttctatgtaa 
cacacccagc 
tggaagaacc 
ctattaagtg 
taatggcatt 
agtggacgta 
caaccatgcc 
gaatattaag 
agcagcgaat 
cggaacatta 
atcgtgcagg 
acccacactg 
agctttgttg 
cctcgcgcaa 
ccggaaccac 
taaacacaaa 
aaaaacaacc 
ctggtcaccg 
gtaagactcg 
agcccggggg 
taacaaaatt 
aaatagcacc 



ttaaattttt 
tataaatcaa 
ccactattaa 
ggcccactac 
ctaaatcgga 
gtggcgagaa 
gcggtcacgc 
tcccattcgc 
ctattacgcc 
gggttttccc 
tagggcgaat 
gaattcagga 
ccacaactac 
agaataaaga 
caagtcattt 
accttaatca 
agtacacagt 
tattcttagg 
taataaactc 
gctgctgtcc 
gggtagagtc 
actgctgccg 
tgattcgcac 
tctcacttaa 
agtgcaaggc 
accacaagcg 
cctcttttgg 
tggcgccatc 
gcagggaacc 
tcatgctcgt 
ggattacaag 
gcgtaaatcc 
tgttacattc 
aaggaggtag 
gtagtgtcat 
cgggcgtgac 
ttgtagtata 
actccttcat 
caacctacac 
atgttttttt 
aacgcgctcc 
tgtaagatgt 
aaggctaaac 
caaataattc 
tccggccatt 
catgattgca 
acaaaaatac 
tctgcacgga 
attatgacac 
catgggcggc 
aaaagaaagc 
cacagaaaaa 
ataaaataac 
cttataagca 
tgattaaaaa 
gtaaacacat 
aatacatacc 
aataggagag 
ctcccgctcc 



gttaaatcag 
aagaatagac 
agaacgtgga 
gtgaaccatc 
accctaaagg 
aggaagggaa 
tgcgcgtaac 
cattcaggct 
agctggcgaa 
agtcacgacg 
tgggtaccgg 
gacacaactc 
attaatgaaa 
atcgtttgtg 
ttcattcagt 
aactcacaga 
cctttctccc 
tgttatattc 
cccgggcagc 
aacttgcggt 
ataatcgtgc 
ccgccgctcc 
cgcccgcagc 
atcagcacag 
gctgtatcca 
caggtagatt 
catgttgtaa 
caccaccatc 
gggactggaa 
catgatatca 
ctcctcccgc 
cacactgcag 
gggcagcagc 
acgatcccta 
gccaaatgga 
aaacagatct 
tccactctct 
gcgccgctgc 
attcgttctg 
ttttattcca 
cctccggtgg 
tgcacaatgg 
ccttcagggt 
tcatctcgcc 
gtaaaaatct 
aaaattcagg 
cgcgatcccg 
ccagcgcggc 
gcatactcgg 
gatataaaat 
acatcgtagt 
gacaccattt 
aaaaaaacat 
taagacggac 
gcaccaccga 
caggttgatt 
cgcaggcgta 
aaaaacacat 
agaacaacat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 
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acagcgcttc 
aaaaaacacc 
gcagagcgag 
ccagaaaacc 
caaatcgtca 
tctccgcaag 
ccggcttcca 
cagacaaggt 
gcggcataaa 
gcgagcgatc 
tgcaacgcgg 
cagcctcgcg 
ttcatccccg 
catgtcttta 
aacacgcaga 
cgtgtggcct 
ccgcagatcc 
agtttctgat 
aatctcgtgc 
gcgccgatgg 
cgattccgga 
gccgtgcaca 
agccggtcgc 
tcggcccatt 
cgattgctga 
ccgtcgcgca 
acctcgtgca 
cggtcattga 
tcttctggag 
atccggagct 
aactctatca 
gcgacgcaat 
gcgcggccgt 
ccagcactcg 
aggagacaat 
acgggtgttg 
gataccccac 
caccccccaa 
gccatagcca 
tcgtgggggt 
cagacccatg 
ccgggcgtct 
cgcccagtgc 
gccctgacgg 
gagctgcatg 
taatcagcca 
cctgaacctg 
taatggttac 
gcattctagt 
gttctagagc 
atttcgagct 
acaattccac 
gtgagctaac 
tcgtgccagc 
cgctcttccg 
gtatcagctc 
aagaacatgt 
gcgtttttcc 
aggtggcgaa 
gtgcgctctc 
ggaagcgtgg 
cgctccaagc 



acagcggcag 
actcgacacg 
tatatatagg 
gcacgcgaac 
cttccgtttt 
aattgattgg 
ttcaggtcga 
atagggcggc 
tcgccgtgac 
cttgaagctg 
gcatcccgat 
tcgcgaacgc 
tggcccgttg 
gttctatgat 
tgcagtcggg 
cgaacaccga 
cgggcaatga 
cgaaaagttc 
tttcagcttc 
tttctacaaa 
agtgcttgac 

gggtgtcacg 

ggaggccatg 
cggaccgcaa 
tccccatgtg 
ggctctcgat 
cgcggatttc 
ctggagcgag 
gccgtggttg 
tgcaggatcg 
gagcttggtt 
cgtccgatcc 
ctggaccgat 
tccgagggca 
accggaagga 
ggtcgtttgt 
cgagacccca 
gttcgggtga 
ctggccccgt 
tattattttg 
gtttttggat 
gtggctgcca 
cgtcgaccgg 
gcttgtctgc 
tgtcagaggt 
taccacattt 
aaacataaaa 
aaataaagca 
tgtggtttgt 
ggccgccacc 
tggcgtaatc 
acaacatacg 
tcacattaat 
tgcattaatg 
cttcctcgct 
actcaaaggc 
gagcaaaagg 
ataggctccg 
acccgacagg 
ctgttccgac 
cgctttctca 
tgggctgtgt 



cctaacagtc 
gcaccagctc 
actaaaaaat 
ctacgcccag 
cccacgttac 
ctccaattct 
ggtggcccgg 
gcctacaatc 
gatcagcggt 
tccctgatgg 
gccgccggaa 
cagcaagacg 
ctcgcgtttg 
gacacaaacc 
gcggcgcggt 
gcgaccctgc 
gatatgaaaa 
gacagcgtct 
gatgtaggag 
gatcgttatg 
attggggaat 
ttgcaagacc 
gatgcgatcg 
ggaatcggtc 
tatcactggc 
gagctgatgc 
ggctccaaca 
gcgatgttcg 
gcttgtatgg 
ccgcggctcc 
gacggcaatt 
ggagccggga 
ggctgtgtag 
aaggaatagg 
acccgcgcta 
tcataaacgc 
ttggggccaa 
aggcccaggg 
gggttaggga 
ggcgttgcgt 
ggcctgggca 
aacacccccg 
tcatggctgc 
tcccggcatc 
tttcaccgtc 
gtagaggttt 
tgaatgcaat 
atagcatcac 
ccaaactcat 
gcggtggagc 
atggtcatag 
agccggaagc 
tgcgttgcgc 
aatcggccaa 
cactgactcg 
ggtaatacgg 
ccagcaaaag 
cccccctgac 
actataaaga 
cctgccgctt 
tagctcacgc 
gcacgaaccc 



agccttacca 
aatcagtcac 
gacgtaacgg 
aaacgaaagc 
gtaacttccc 
tggagtggtg 
ctccatgcac 
catgccaacc 
ccagtgatcg 
tcgtcatcta 
gcgagaagaa 
tagcccagcg 
ctggcggtgt 
ccgcccagcg 
cccaggtcca 
agcgacccgc 
agcctgaact 
ccgacctgat 
ggcgtggata 
tttatcggca 
tcagcgagag 
tgcctgaaac 
ctgcggccga 
aatacactac 
aaactgtgat 
tttgggccga 
atgtcctgac 
gggattccca 
agcagcagac 
gggcgtatat 
tcgatgatgc 
ctgtcgggcg 
aagtactcgc 
ggagatgggg 
tgacggcaat 

ggggttcggt 

tacgcccgcg 
ctcgcagcca 
cggggtcccc 

ggggtctggt 

tggaccgcat 
acccccaaaa 
gccccgacac 
cgcttacaga 
atcaccgaaa 
tacttgcttt 
tgttgttgtt 
aaatttcaca 
caatgtatct 
tccagctttt 
ctgtttcctg 
ataaagtgta 
tcactgcccg 
cgcgcgggga 
ctgcgctcgg 
ttatccacag 
gccaggaacc 
gagcatcaca 
taccaggcgt 
accggatacc 
tgtaggtatc 
cccgttcagc 



gtaaaaaaga 
agtgtaaaaa 
ttaaagtcca 
caaaaaaccc 
ggatccgcgg 
aatccgttag 
cgcgacgcaa 
cgttccatgt 
aagttaggct 
cctgcctgga 
tcataatggg 
cgtcggccgc 
ccccggaaga 
tcttgtcatt 
cttcgcatat 
ttaacagcgt 
caccgcgacg 
gcagctctcg 
tgtcctgcgg 
ctttgcatcg 
cctgacctat 
cgaactgccc 
tcttagccag 
atggcgtgat 
ggacgacacc 
ggactgcccc 
ggacaatggc 
atacgaggtc 
gcgctacttc 
gctccgcatt 
agcttgggcg 
tacacaaatc 
cgatagtgga 
gaggctaact 
aaaaagacag 
cccagggctg 
tttcttcctt 
acgtcggggc 
catggggaat 
ccacgactgg 
gtactggcgc 
accaccgcgc 
ccgccaacac 
caagctgtga 
cgcgcgaggc 
aaaaaacctc 
aacttgttta 
aataaagcat 
tatcatgtct 
gttcccttta 
tgtgaaattg 
aagcctgggg 
ctttccagtc 
gaggcggttt 
tcgttcggct 
aatcagggga 
gtaaaaaggc 
aaaatcgacg 
ttccccctgg 
tgtccgcctt 
tcagttcggt 
ccgaccgctg 



aaacctatta 
agggccaagt 
caaaaaacac 
acaacttcct 
cattcacagt 
cgaggtgccg 
cgcggggagg 
gctcgccgag 
ggtaagagcc 
cagcatggcc 
gaaggccatc 
catgccctgc 
aatatatttg 
ggcgaattcg 
taaggtgacg 
caacagcgtg 
tctgtcgaga 
gagggcgaag 
gtaaatagct 
gccgcgctcc 
tgcatctccc 
gctgttctgc 
acgagcgggt 
ttcatatgcg 
gtcagtgcgt 
gaagtccggc 
cgcataacag 
gccaacatct 
gagcggaggc 
ggtcttgacc 
cagggtcgat 
gcccgcagaa 
aaccgacgcc 
gaaacacgga 
aataaaacgc 
gcactctgtc 
ttccccaccc 
ggcaggccct 
ggtttatggt 
actgagcaga 
gacacgaaca 
ggatttctgg 
ccgctgacgc 
ccgtctccgg 
agccggatca 
cccacctccc 
ttgcagctta 
ttttttcact 
ggatccacta 
gtgagggtta 
ttatccgctc 
tgcctaatga 
gggaaacctg 
gcgtattggg 
gcggcgagcg 
taacgcagga 
cgcgttgctg 
ctcaagtcag 
aagctccctc 
tctcccttcg 
gtaggtcgtt 
cgccttatcc 



3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
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ggtaactatc 
actggtaaca 
tggcctaact 
gttaccttcg 
ggtggttttt 
cctttgatct 
ttggtcatga 
tttaaatcaa 
agtgaggcac 
gtcgtgtaga 
ccgcgagacc 
gccgagcgca 
cgggaagcta 
acaggcatcg 
cgatcaaggc 
cctccgatcg 
ctgcataatt 
tcaaccaagt 
atacgggata 
tcttcggggc 
actcgtgcac 
aaaacaggaa 
ctcatactct 
ggatacatat 
cgaaaagtgc 



gtcttgagtc 
ggattagcag 
acggctacac 
gaaaaagagt 
ttgtttgcaa 
tttctacggg 
gattatcaaa 
tctaaagtat 
ctatctcagc 
taactacgat 
ca.cgctca.cc 
gaagtggtcc 
gagtaagtag 
tggtgtcacg 
gagttacatg 
ttgtcagaag 
ctcttactgt 
cattctgaga 
ataccgcgcc 
gaaaactctc 
ccaactgatc 
ggcaaaatgc 
tcctttttca 
ttgaatgtat 



caacccggta 
agcgaggtat 
tagaaggaca 
tggtagctct 
gcagcagatt 
gtctgacgct 
aaggatcttc 
atatgagtaa 
gatctgtcta 
acgggagggc 
ggctccagat 
tgcaacttta 
ttcgccagtt 
ctcgtcgttt 
atcccccatg 
taagttggcc 
catgccatcc 
atagtgtatg 
acatagcaga 
aaggatctta 
ttcagcatct 
cgcaaaaaag 
atattattga 
ttagaaaaat 



agacacgact 
gtaggcggtg 
gtatttggta 
tgatccggca 
acgcgcagaa 
cagtggaacg 
acctagatcc 
acttggtctg 
tttcgttcat 
ttaccatctg 
ttatcagcaa 
tccgcctcca 
aatagtttgc 
ggtatggctt 
ttgtgcaaaa 
gcagtgttat 
gtaagatgct 
cggcgaccga 
actttaaaag 
ccgctgttga 
tttactttca 
ggaataaggg 
agcatttatc 
aaacaaatag 



tatcgccact 
ctacagagtt 
tctgcgctct 
aacaaaccac 
aaaaaggatc 
aaaactcacg 
ttttaaatta 
acagttacca 
ccatagttgc 
gccccagtgc 
taaaccagcc 
tccagtctat 
gcaacgttgt 
cattcagctc 
aagcggttag 
cactcatggt 
tttctgtgac 
gttgctcttg 
tgctcatcat 
gatccagttc 
ccagcgtttc 
cgacacggaa 
agggttattg 
gggttccgcg 



ggcagcagcc 
cttgaagtgg 
gctgaagcca 
cgctggtagc 
tcaagaagat 
ttaagggatt 
aaaatgaagt 
atgcttaatc 
ctgactcccc 
tgcaatgata 
agccggaagg 
taattgttgc 
tgccattgct 
cggttcccaa 
ctccttcggt 
tatggcagca 
tggtgagtac 
cccggcgtca 
tggaaaacgt 
gatgtaaccc 
tgggtgagca 
atgttgaata 
tctcatgagc 
cacatttccc 



7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8710 



<210> 5 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 5 

atgggatcca agatgaagcg cgcaagaccg 30 



<210> 6 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<400> 6 

cataacgcgg ccgcttcttt attcttgggc 

<210> 7 
<211> 7148 
<212> DNA 

<213> Artificial Sequence 



30 



<220> 

<223> Description of Artificial Sequence: plasmid 
<400> 7 

gacggatcgg gagatctccc gatcccctat ggtcgactct cagtacaatc tgctctgatg 60 
ccgcatagtt aagccagtat ctgctccctg cttgtgtgtt ggaggtcgct gagtagtgcg 120 
cgagcaaaat ttaagctaca acaaggcaag gcttgaccga caattgcatg aagaatctgc 180 



WO 01/83729 



5 



PCT/EP01/04863 



ttagggttag gcgttttgcg ctgcttcgcg 
gattattgac tagttattaa tagtaatcaa 
tggagttccg cgttacataa cttacggtaa 
cccgcccatt gacgtcaata atgacgtatg 
attgacgtca atgggtggac tatttacggt 
atcatatgcc aagtacgccc cctattgacg 
atgcccagta catgacctta tgggactttc 
tcgctattac catggtgatg cggttttggc 
actcacgggg atttccaagt ctccacccca 
aaaatcaacg ggactttcca aaatgtcgta 
gtaggcgtgt acggtgggag gtctatataa 
ctgcttactg gcttatcgaa attaatacga 
gagctcggat ccaagatgaa gcgcgcaaga 
ccatatgaca cggaaaccgg tcctccaact 
cccaatgggt ttcaagagag tccccctggg 
gttacctcca atggcatgct tgcgctcaaa 
ggcaacctta cctcccaaaa tgtaaccact 
aacataaacc tggaaatatc tgcacccctc 
gccgccgcac ctctaatggt cgcgggcaac 
accgtgcacg actccaaact tagcattgcc 
aagctagccc tgcaaacatc aggccccctc 
actgcctcac cccctctaac tactgccact 
atttatacac aaaatggaaa actaggacta 
gacctaaaca ctttgaccgt agcaactggt 
caaactaaag ttactggagc cttgggtttt 
gcaggaggac taaggattga ttctcaaaac 
tttgatgctc aaaaccaact aaatctaaga 
gcccacaact tggatattaa ctacaacaaa 
tccaaaaagc ttgaggttaa cctaagcact 
atagccatta atgcaggaga tgggcttgaa 
cccctcaaaa caaaaattgg ccatggccta 
aaactaggaa ctggccttag ttttgacagc 
aatgataagc taactttgtg gaccacacca 
gagaaagatg ctaaactcac tttggtctta 
gtttcagttt tggctgttaa aggcagtttg 
catcttatta taagatttga cgaaaatgga 
gaatattgga actttagaaa tggagatctt 
ggatttatgc ctaacctatc agcttatcca 
attgtcagtc aagtttactt aaacggagac 
ctaaacggta cacaggaaac aggagacaca 
tgggactggt ctggccacaa ctacattaat 
tcatacattg cccaagaata aagaagcggc 
ctatagtgtc acctaaatgc tagagctcgc 
cagccatctg ttgtttgccc ctcccccgtg 
actgtccttt cctaataaaa tgaggaaatt 
attctggggg gtggggtggg gcaggacagc 
catgctgggg atgcggtggg ctctatggct 
agggggtatc cccacgcgcc ctgtagcggc 
cgcagcgtga ccgctacact tgccagcgcc 
tcctttctcg ccacgttcgc cggctttccc 
gggttccgat ttagfcgcttt acggcacctc 
tcacgtagtg ggccatcgcc ctgatagacg 
ttctttaata gtggactctt gttccaaact 
tcttttgatt tataagggat tttggggatt 
taacaaaaat ttaacgcgaa ttaattctgt 
ccccaggctc cccaggcagg cagaagtatg 
aggtgtggaa agtccccagg ctccccagca 
tagtcagcaa ccatagtccc gcccctaact 
tccgcccatt ctccgcccca tggctgacta 
gcctctgcct ctgagctatt ccagaagtag 
tgcaaaaagc tcccgggagc ttgtatatcc 
ggatcgtttc gcatgattga acaagatgga 



atgtacgggc cagatatacg cgttgacatt 240 
ttacggggtc attagttcat agcccatata 300 
atggcccgcc tggctgaccg cccaacgacc 360 
ttcccatagt aacgccaata gggactttcc 42 0 
aaactgccca cttggcagta catcaagtgt 480 
tcaatgacgg taaatggccc gcctggcatt 540 
ctacttggca gtacatctac gtattagtca 600 
agtacatcaa tgggcgtgga tagcggtttg 660 
ttgacgtcaa tgggagtttg ttttggcacc 720 
acaactccgc cccattgacg caaatgggcg 780 
gcagagctct ctggctaact agagaaccca 840 
ctcactatag ggagacccaa gcttggtacc 900 
ccgtctgaag ataccttcaa ccccgtgtat 960 
gtgccttttc ttactcctcc ctttgtatcc 1020 
gtactctctt tgcgcctatc cgaacctcta 1080 
atgggcaacg gcctctctct ggacgaggcc 114 0 
gtgagcccac ctctcaaaaa aaccaagtca 1200 
acagttacct cagaagccct aactgtggct 1260 
acactcacca tgcaatcaca ggccccgcta 132 0 
acccaaggac ccctcacagt gtcagaagga 1380 
accaccaccg atagcagtac ccttactatc 1440 
ggtagcttgg gcattgactt gaaagagccc 15 00 
aagtacgggg ctcctttgca tgtaacagac 1560 
ccaggtgtga ctattaataa tacttccttg 162 0 
gattcacaag gcaatatgca acttaatgta 1680 
agacgcctta tacttgatgt tagttatccg 1740 
ctaggacagg gccctctttt tataaactca 1800 
ggcctttact tgtttacagc ttcaaacaat 1860 
gccaaggggt tgatgtttga cgctacagcc 192 0 
tttggttcac ctaatgcacc aaacacaaat 1980 
gaatttgatt caaacaaggc tatggttcct 2040 
acaggtgcca ttacagtagg aaacaaaaat 2100 
gctccatctc ctaactgtag actaaatgca 2160 
acaaaatgtg gcagtcaaat acttgctaca 2220 
gctccaatat ctggaacagt tcaaagtgct 2280 
gtgctactaa acaattcctt cctggaccca 2340 
actgaaggca cagcctatac aaacgctgtt 240 0 
aaatctcacg gtaaaactgc caaaagtaac 2460 
aaaactaaac ctgtaacact aaccattaca 2520 
actccaagtg catactctat gtcattttca 2580 
gaaatatttg ccacatcctc ttacactttt 2640 
cgctcgagca tgcatctaga gggccctatt 2700 
tgatcagcct cgactgtgcc ttctagttgc 2760 
ccttccttga ccctggaagg tgccactccc 282 0 
gcatcgcatt gtctgagtag gtgtcattct 28 80 
aagggggagg attgggaaga caatagcagg 2940 
tctgaggcgg aaagaaccag ctggggctct 3000 
gcattaagcg cggcgggtgt ggtggttacg 3060 
ctagcgcccg ctcctttcgc tttcttccct 3120 
cgtcaagctc taaatcgggg catcccttta 3180 
gaccccaaaa aacttgatta gggtgatggt 3240 
gtttttcgcc ctttgacgtt ggagtccacg 3300 
ggaacaacac tcaaccctat ctcggtctat 3360 
tcggcctatt ggttaaaaaa tgagctgatt 342 0 
ggaatgtgtg tcagttaggg tgtggaaagt 3480 
caaagcatgc atctcaatta gtcagcaacc 3540 
ggcagaagta tgcaaagcat gcatctcaat 3600 
ccgcccatcc cgcccctaac tccgcccagt 3660 
atttttttta tttatgcaga ggccgaggcc 3720 
tgaggaggct tttttggagg cctaggcttt 3780 
attttcggat ctgatcaaga gacaggatga 3840 
ttgcacgcag gttctccggc cgcttgggtg 3900 
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gagaggctat tcggctatga 
ttccggctgt cagcgcaggg 
ctgaatgaac tgcaggacga 
tgcgcagctg tgctcgacgt 
gtgccggggc aggatctcct 
gctgatgcaa tgcggcggct 
gcgaaacatc gcatcgagcg 
gatctggacg aagagcatca 
cgcatgcccg acggcgagga 
atggtggaaa atggccgctt 
cgctatcagg acatagcgtt 
gctgaccgct tcctcgtgct 
tatcgccttc ttgacgagtt 
cgacgcccaa cctgccatca 
gcttcggaat cgttttccgg 
tggagttctt cgcccacccc 
atagcatcac aaatttcaca 
ccaaactcat caatgtatct 
cgtaatcatg gtcatagctg 
acatacgagc cggaagcata 
cattaattgc gttgcgctca 
attaatgaat cggccaacgc 
cctcgctcac tgactcgctg 
caaaggcggt aatacggtta 
caaaaggcca gcaaaaggcc 
ggctccgccc ccctgacgag 
cgacaggact ataaagatac 
ttccgaccct gccgcttacc 
tttctcaatg ctcacgctgt 
gctgtgtgca cgaacccccc 
ttgagtccaa cccggtaaga 
ttagcagagc gaggtatgta 
gctacactag aaggacagta 
aaagagttgg tagctcttga 
tttgcaagca gcagattacg 
ctacggggtc tgacgctcag 
tatcaaaaag gatcttcacc 
aaagtafcata tgagtaaact 
tctcagcgat ctgtctattt 
ctacgatacg ggagggctta 
gctcaccggc tccagattta 
gtggtcctgc aactttatcc 
taagtagttc gccagttaat 
tgtcacgctc gtcgtttggt 
ttacatgatc ccccatgttg 
tcagaagtaa gttggccgca 
ttactgtcat gccatccgta 
tctgagaata gtgtatgcgg 
ccgcgccaca tagcagaact 
aactctcaag gatcttaccg 
actgatcttc agcatctttt 
aaaatgccgc aaaaaaggga 
tttttcaata ttattgaagc 
aatgtattta gaaaaataaa 
ctgacgtc 



ctgggcacaa cagacaatcg 
gcgcccggtt ctttttgtca 
ggcagcgcgg ctatcgtggc 
tgtcactgaa gcgggaaggg 
gtcatctcac cttgctcctg 
gcatacgctt gatccggcta 
agcacgtact cggatggaag 
ggggctcgcg ccagccgaac 
tctcgtcgtg acccatggcg 
ttctggattc atcgactgtg 
ggctacccgt gatattgctg 
ttacggtatc gccgctcccg 
cttctgagcg ggactctggg 
cgagatttcg attccaccgc 
gacgccggct ggatgatcct 
aacttgttta ttgcagctta 
aataaagcat ttttttcact 
tatcatgtct gtataccgtc 
tttcctgtgt gaaattgtta 
aagtgtaaag cctggggtgc 
ctgcccgctt tccagtcggg 
gcggggagag gcggtttgcg 
cgctcggtcg ttcggctgcg 
tccacagaat caggggataa 
aggaaccgta aaaaggccgc 
catcacaaaa atcgacgctc 
caggcgtttc cccctggaag 
ggatacctgt ccgcctttct 
aggtatctca gttcggtgta 
gttcagcccg accgctgcgc 
cacgacttat cgccactggc 
ggcggtgcta cagagttctt 
tttggtatct gcgctctgct 
tccggcaaac aaaccaccgc 
cgcagaaaaa aaggatctca 
tggaacgaaa actcacgtta 
tagatccttt taaattaaaa 
tggtctgaca gttaccaatg 
cgttcatcca tagttgcctg 
ccatctggcc ccagtgctgc 
tcagcaataa accagccagc 
gcctccatcc agtctattaa 
agtttgcgca acgttgttgc 
atggcttcat tcagctccgg 
tgcaaaaaag cggttagctc 
gtgttatcac tcatggttat 
agatgctttt ctgtgactgg 
cgaccgagtt gctcttgccc 
ttaaaagtgc tcatcattgg 
ctgttgagat ccagttcgat 
actttcacca gcgtttctgg 
ataagggcga cacggaaatg 
atttatcagg gttattgtct 
caaatagggg ttccgcgcac 



gctgctctga tgccgccgtg 3 960 
agaccgacct gtccggtgcc 4 02 0 
tggccacgac gggcgttcct 40 80 
actggctgct attgggcgaa 4140 
ccgagaaagt atccatcatg 42 00 
cctgcccatt cgaccaccaa 42 60 
ccggtcttgt cgatcaggat 4320 
tgttcgccag gctcaaggcg 4380 
atgcctgctt gccgaatatc 4440 
gccggctggg tgtggcggac 4500 
aagagcttgg cggcgaatgg 4560 
attcgcagcg catcgccttc 4620 
gttcgaaatg accgaccaag 4680 
cgccttctat gaaaggttgg 4740 
ccagcgcggg gatcfecatgc 4800 
taatggttac aaataaagca 4 860 
gcattctagt tgtggtttgt 4920 
gacctctagc tagagcttgg 4980 
tccgctcaca attccacaca 5040 
ctaatgagtg agctaactca 5100 
aaacctgtcg tgccagctgc 5160 
tattgggcgc tcttccgctt 5220 
gcgagcggta tcagctcact 52 80 
cgcaggaaag aacatgtgag 5340 
gttgctggcg tttttccata 5400 
aagtcagagg tggcgaaacc 5460 
ctccctcgtg cgctctcctg 5520 
cccttcggga agcgtggcgc 5580 
ggtcgttcgc tccaagctgg 564 0 
cttatccggt aactatcgtc 5700 
agcagccact ggtaacagga 57 60 
gaagtggtgg cctaactacg 5820 
gaagccagtt accttcggaa 5880 
tggtagcggt ggtttttttg 5940 
agaagatcct ttgatctttt 60 00 
agggattttg gtcatgagat 6060 
atgaagtttt aaatcaatct 6120 
cttaatcagt gaggcaccta 6180 
actccccgtc gtgtagataa 6240 
aatgataccg cgagacccac 63 00 
cggaagggcc gagcgcagaa 6360 
ttgttgccgg gaagctagag 6420 
cattgctaca ggcatcgtgg 6480 
ttcccaacga tcaaggcgag 6540 
cttcggtcct ccgatcgttg 6600 
ggcagcactg cataattctc 6660 
tgagtactca accaagtcat 6720 
ggcgtcaata cgggataata 6780 
aaaacgttct tcggggcgaa 6840 
gtaacccact cgtgcaccca 6900 
gtgagcaaaa acaggaaggc 6960 
ttgaatactc atactcttcc 7020 
catgagcgga tacatatttg 7080 
atttccccga aaagtgccac 7140 

7148 



<210> 8 
<211> 7469 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: plasmid 
<400> 8 

gacggatcgg gagatctccc gatcccctat ggtcgactct cagtacaatc tgctctgatg 60 
ccgcatagtt aagccagtat ctgctccctg cttgtgtgtt ggaggtcgct gagtagtgcg 120 
cgagcaaaat ttaagctaca acaaggcaag gcttgaccga caattgcatg aagaatctgc 180 
ttagggttag gcgttttgcg ctgcttcgcg atgtacgggc cagatatacg . cgttgacatt 240 
gattattgac tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata 3 00 
tggagttccg cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc 360 
cccgcccatt gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc 42 0 
attgacgtca atgggtggac tatttacggt aaactgccca cttggcagta catcaagtgt 480 
atcatatgcc aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt 540 
atgcccagta catgacctta tgggactttc ctacttggca gtacatctac gtattagtca 600 
tcgctattac catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg 660 
actcacgggg atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc 720 
aaaatcaacg ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg 780 
gtaggcgtgt acggtgggag gtctatataa gcagagctct ctggctaact agagaaccca 840 
ctgcttactg gcttatcgaa attaatacga ctcactatag ggagacccaa gcttggtacc 900 
gagctcggat ctgaattcga gctcgctgtt gggctcgcgg ttgaggacaa actcttcgcg 960 
gtctttccag tactcttgga tcggaaaccc gtcggcctcc gaacggtact ccgccaccga 102 0 
gggacctgag cgagtccgca tcgaccggat cggaaaacct ctcgagaaag gcgtctaacc 1080 
agtcacagtc gcaaggtagg ctgagcaccg tggcgggcgg cagcgggtgg cggtcggggt 1140 
tgtttctggc ggaggtgctg ctgatgatgt aattaaagta ggcggtcttg agacggcgga 12 00 
tggtcgaggt gaggtgtggc aggcttgaga tccaagatga agcgcgcaag accgtctgaa 1260 
gataccttca accccgtgta tccatatgac acggaaaccg gtcctccaac tgtgcctttt 1320 
cttactcctc cctttgtatc ccccaatggg tttcaagaga gtccccctgg ggtactctct 13 80 
ttgcgcctat ccgaacctct agttacctcc aatggcatgc ttgcgctcaa aatgggcaac 1440 
ggcctctctc tggacgaggc cggcaacctt acctcccaaa atgtaaccac tgtgagccca 150 0 
cctctcaaaa aaaccaagtc aaacataaac ctggaaatat ctgcacccct cacagttacc 1560 
tcagaagccc taactgtggc tgccgccgca cctctaatgg tcgcgggcaa cacactcacc 162 0 
atgcaatcac aggccccgct aaccgtgcac gactccaaac ttagcattgc cacccaagga 1680 
cccctcacag tgtcagaagg aaagctagcc ctgcaaacat caggccccct caccaccacc 1740 
gatagcagta cccttactat cactgcctca ccccctctaa ctactgccac tggtagcttg 1800 
ggcattgact tgaaagagcc catttataca caaaatggaa aactaggact aaagtacggg 1860 
gctcctttgc atgtaacaga cgacctaaac actttgaccg tagcaactgg tccaggtgtg 1920 
actattaata atacttcctt gcaaactaaa gttactggag ccttgggttt tgattcacaa 1980 
ggcaatatgc aacttaatgt agcaggagga ctaaggattg attctcaaaa cagacgcctt 2040 
atacttgatg ttagttatcc gtttgatgct caaaaccaac taaatctaag actaggacag 2100 
ggccctcttt ttataaactc agcccacaac ttggatatta actacaacaa aggcctttac 2160 
ttgtttacag cttcaaacaa ttccaaaaag cttgaggtta acctaagcac tgccaagggg 2220 
ttgatgtttg acgctacagc catagccatt aatgcaggag atgggcttga atttggttca 2280 
cctaatgcac caaacacaaa tcccctcaaa acaaaaattg gccatggcct agaatttgat 2340 
tcaaacaagg ctatggttcc taaactagga actggcctta gttttgacag cacaggtgcc 24 00 
attacagtag gaaacaaaaa taatgataag ctaactttgt ggaccacacc agctccatct 2460 
cctaactgta gactaaatgc agagaaagat gctaaactca ctttggtctt aacaaaatgt .2520 
ggcagtcaaa tacttgctac agtttcagtt ttggctgtta aaggcagttt ggctccaata 25 80 
tctggaacag ttcaaagtgc tcatcttatt ataagatttg acgaaaatgg agtgctacta 2640 
aacaattcct tcctggaccc agaatattgg aactttagaa atggagatct tactgaaggc 2700 
acagcctata caaacgctgt tggatttatg cctaacctat cagcttatcc aaaatctcac 2760 
ggtaaaactg ccaaaagtaa cattgtcagt caagtttact taaacggaga caaaactaaa 2820 
cctgtaacac taaccattac actaaacggt acacaggaaa caggagacac aactccaagt 2880 
gcatactcta tgtcattttc atgggactgg tctggccaca actacattaa tgaaatattt 2940 
gccacatcct cttacacttt ttcatacatt gcccaagaat aaagaagcgg ccgctcgagc 3000 
atgcatctag agggccctat tctatagtgt cacctaaatg ctagagctcg ctgatcagcc 3060 
tcgactgtgc cttctagttg ccagccatct gttgtttgcc cctcccccgt gccttccttg 3120 
accctggaag gtgccactcc cactgtcctt tcctaataaa atgaggaaat tgcatcgcat 3180 
tgtctgagta ggtgtcattc tattctgggg ggtggggtgg ggcaggacag caagggggag 3240 
gattgggaag acaatagcag gcatgctggg gatgcggtgg gctctatggc ttctgaggcg 3300 
gaaagaacca gctggggctc tagggggtat ccccacgcgc cctgtagcgg cgcattaagc 3360 
gcggcgggtg tggtggttac gcgcagcgtg accgctacac ttgccagcgc cctagcgccc 3420 
gctcctttcg ctttcttccc ttcctttctc gccacgttcg ccggctttcc ccgtcaagct 3480 
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ctaaatcggg gcatcccttt agggttccga 
aaacttgatt agggtgatgg ttcacgtagt 
cctttgacgt tggagtccac gttctttaat 
ctcaacccta tctcggtcta ttcttttgat 
tggttaaaaa atgagctgat ttaacaaaaa 
gtcagttagg gtgtggaaag tccccaggct 
catctcaatt agtcagcaac caggtgtgga 
atgcaaagca tgcatctcaa ttagtcagca 
ccgcccctaa ctccgcccag ttccgcccat 
atttatgcag aggccgaggc cgcctctgcc 
ttttttggag gcctaggctt ttgcaaaaag 
tctgatcaag agacaggatg aggatcgttt 
ggttctccgg ccgcttgggt ggagaggcta 
ggctgctctg atgccgccgt gttccggctg 
aagaccgacc tgtccggtgc cctgaatgaa 
ctggccacga cgggcgttcc ttgcgcagct 
gactggctgc tattgggcga agtgccgggg 
gccgagaaag tatccatcat ggctgatgca 
acctgcccat tcgaccacca agcgaaacat 
gccggtcttg tcgatcagga tgatctggac 
ctgttcgcca ggctcaaggc gcgcatgccc 
gatgcctgct tgccgaatat catggtggaa 
ggccggctgg gtgtggcgga ccgctatcag 
gaagagcttg gcggcgaatg ggctgaccgc 
gattcgcagc gcatcgcctt ctatcgcctt 
ggttcgaaat gaccgaccaa gcgacgccca 
ccgccttcta' tgaaaggttg ggcttcggaa 
tccagcgcgg ggatctcatg ctggagttct 
ataatggtta caaataaagc aatagcatca 
tgcattctag ttgtggtttg tccaaactca 
cgacctctag ctagagcttg gcgtaatcat 
atccgctcac aattccacac aacatacgag 
cctaatgagt gagctaactc acattaattg 
gaaacctgtc gtgccagctg cattaatgaa 
gtattgggcg ctcttccgct tcctcgctca 
ggcgagcggt atcagctcac tcaaaggcgg 
acgcaggaaa gaacatgtga gcaaaaggcc 
cgttgctggc gtttttccat aggctccgcc 
caagtcagag gtggcgaaac ccgacaggac 
gctccctcgt gcgctctcct gttccgaccc 
tcccttcggg aagcgtggcg ctttctcaat 
aggtcgttcg ctccaagctg ggctgtgtgc 
ccttatccgg taactatcgt cttgagtcca 
cagcagccac tggtaacagg attagcagag 
tgaagtggtg gcctaactac ggctacacta 
tgaagccagt taccttcgga aaaagagttg 
ctggtagcgg tggttttttt gtttgcaagc 
aagaagatcc tttgatcttt tctacggggt 
aagggatttt ggtcatgaga ttatcaaaaa 
aatgaagttt taaatcaatc taaagtatat 
gcttaatcag tgaggcacct atctcagcga 
gactccccgt cgtgtagata actacgatac 
caatgatacc gcgagaccca cgctcaccgg 
ccggaagggc cgagcgcaga agtggtcctg 
attgttgccg ggaagctaga gtaagtagtt 
ccattgctac aggcatcgtg gtgtcacgct 
gttcccaacg atcaaggcga gttacatgat 
ccttcggtcc tccgatcgtt gtcagaagta 
tggcagcact gcafcaattct cttactgtca 
gtgagtactc aaccaagtca ttctgagaat 
cggcgtcaat acgggataat accgcgccac 
gaaaacgttc ttcggggcga aaactctcaa 



tttagtgctt tacggcacct cgaccccaaa 3540 
gggccatcgc cctgatagac ggtttttcgc 3600 
agtggactct tgttccaaac tggaacaaca 3660 
ttataaggga ttttggggat ttcggcctat 3720 
tttaacgcga attaattctg tggaatgtgt 3780 
ccccaggcag gcagaagtat gcaaagcatg 384 0 
aagtccccag gctccccagc aggcagaagt 3900 
accatagtcc cgcccctaac tccgcccatc 3960 
tctccgcccc atggctgact aatttttttt 402 0 
tctgagctat tccagaagta gtgaggaggc 4080 
ctcccgggag cttgtatatc cattttcgga 4140 
cgcatgattg aacaagatgg attgcacgca 42 0 0 
ttcggctatg actgggcaca acagacaatc 42 60 
tcagcgcagg ggcgcccggt tctttttgtc 432 0 
ctgcaggacg aggcagcgcg gctatcgtgg 43 80 
gtgctcgacg ttgtcactga agcgggaagg 444 0 
caggatctcc tgtcatctca ccttgctcct 4500 
atgcggcggc tgcatacgct tgatccggct 4560 
cgcatcgagc gagcacgtac tcggatggaa 4620 
gaagagcatc aggggctcgc gccagccgaa 4680 
gacggcgagg atctcgtcgt gacccatggc 474 0 
aatggccgct tttctggatt catcgactgt 4800 
gacatagcgt tggctacccg tgatattgct 4860 
ttcctcgtgc tttacggtat cgccgctccc 4920 
cttgacgagt tcttctgagc gggactctgg 4980 
acctgccatc acgagatttc gattccaccg 5040 
tcgttttccg ggacgccggc tggatgatcc 5100 
tcgcccaccc caacttgttt attgcagctt 5160 
caaatttcac aaataaagca tttttttcac 5220 
tcaatgtatc ttatcatgtc tgtataccgt 5280 
ggtcatagct gtttcctgtg tgaaattgtt 5340 
ccggaagcat aaagtgtaaa gcctggggtg 5400 
cgttgcgctc actgcccgct ttccagtcgg 5460 
tcggccaacg cgcggggaga ggcggtttgc 5520 
ctgactcgct gcgctcggtc gttcggctgc 5580 
taatacggtt atccacagaa tcaggggata 5640 
agcaaaaggc caggaaccgt aaaaaggccg 5700 
cccctgacga gcatcacaaa aatcgacgct 5760 
tataaagata ccaggcgttt ccccctggaa 5820 
tgccgcttac cggatacctg tccgcctttc 5880 
gctcacgctg taggtatctc agttcggtgt 5940 
acgaaccccc cgttcagccc gaccgctgcg 6000 
acccggtaag acacgactta tcgccactgg 6060 
cgaggtatgt aggcggtgct acagagttct 6120 
gaaggacagt atttggtatc tgcgctctgc 6180 
gtagctcttg atccggcaaa caaaccaccg 6240 
agcagattac gcgcagaaaa aaaggatctc 6300 
ctgacgctca gtggaacgaa aactcacgtt 6360 
ggatcttcac ctagatcctt ttaaattaaa 6420 
atgagtaaac ttggtctgac agttaccaat 648 0 
tctgtctatt tcgttcatcc atagttgcct 6540 
gggagggctt accatctggc cccagtgctg 6600 
ctccagattt atcagcaata aaccagccag 6660 
caactttatc cgcctccatc cagtctatta 6720 
cgccagttaa tagtttgcgc aacgttgttg 6780 
cgtcgtttgg tatggcttca ttcagctccg 6840 
cccccatgtt gtgcaaaaaa gcggttagct 6900 
agttggccgc agtgttatca ctcatggtta 6960 
tgccatccgt aagatgcttt tctgtgactg 7020 
agtgtatgcg gcgaccgagt tgctcttgcc 7080 
atagcagaac tttaaaagtg ctcatcattg 7140 
ggatcttacc gctgttgaga tccagttcga 7200 
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tgtaacccac tcgtgcaccc aactgatctt cagcatcttt tactttcacc agcgtttctg 7260 
ggtgagcaaa aacaggaagg caaaatgccg caaaaaaggg aataagggcg acacggaaat 732 0 
gttgaatact catactcttc ctttttcaat attattgaag catttatcag ggttattgtc 73 80 
tcatgagcgg atacatattt gaatgtattt agaaaaataa acaaataggg gttccgcgca 744 0 
catttccccg aaaagtgcca cctgacgtc 7469 

<210> 9 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 9 

tgcttaagcg gccgcgaagg agaagtcc 2 8 

<210> 10 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 10 

ccgagctagc gactgaaaat gag 23 



<210> 11 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 11 

cctctcgaga gacagcaaga cac 23 



<210> 12 
<211> 11152 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plasmid 
<400> 12 

aagcttgggc agaaatggtt gaactcccga gagtgtccta cacctagggg agaagcagcc 60 
aaggggttgt ttcccaccaa ggacgacccg tctgcgcaca aacggatgag cccatcagac 120 
aaagacatat tcattctctg ctgcaaactt ggcatagctc tgctttgcct ggggctattg 180 
99gga.agttg cggttcgtgc tcgcagggct ctcacccttg actcttttaa tagctcttct 240 
gtgcaagatt acaatctaaa caattcggag aactcgacct tcctcctgag gcaaggacca 300 
cagccaactt cctcttacaa gccgcatcga ttttgtcctt cagaaataga aataagaatg 360 
cttgctaaaa attatatttt taccaataag accaatccaa taggtagatt attagttact 420 
atgttaagaa atgaatcatt atcttttagt actattttta ctcaaattca gaagttagaa 480 
atgggaatag aaaatagaaa gagacgctca acctcaattg aagaacaggt gcaaggacta 540 
ttgaccacag gcctagaagt aaaaaaggga aaaaagagtg tttttgtcaa aataggagac 600 
a ggtggtggc aaccagggac ttatagggga ccttacatct acagaccaac agatgccccc 660 
ttaccatata caggaagata tgacttaaat tgggataggt gggttacagt caatggctat 720 
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aaagtgttat atagatccct 
tgtatgttgt ctcaagaaga 
ctaggaacag gaatgcactt 
ggactaatag aacattattc 
ttggcccaac cttgcggttc 
aacgaggatg tgagacaagt 
gctctgagtg ttctattttc 
ttatgtaaac caagatataa 
cattcacctc ttgtgtgttt 
tcactttcca gagggtcccc 
gctggcgccc gaacagggac 
tgtttgtctt gtattgtctc 
ccatagggac caagctagcg 
accgaagaaa tggccgccag 
cttccacctc ctagccattt 
acggcccccg aagatcccaa 
ttggcggtgc aggaagggat 
ccgcctcacc tttcccggca 
atgccaaacc ttgtaccgga 
agtgacgacg aggatgaaga 
cacggttgca ggtcttgtca 
tcgctttgct atatgaggac 
gtgggtgata gagtggtggg 
ggtttaaaga attttgtatt 
agcccgagcc agaaccggag 
tcctgagacg cccgacatca 
actccggtcc ttctaacaca 
aaccagttgc cgtgagagtt 
ttaacgagcc tgggcaacct 
acctgtgatt gcgtgtgtgg 
taaagggtga gataatgttt 
atataatgcg ccgtgggcta 
tggaagattt ttctgctgtg 
1 1 1 ggagg 1 1 t c t gt ggggc 
acaagtggga atttgaagag 
tgggtcacca ggcgcttttc 
ggcgcgctgc ggctgctgtt 
cccatctgag cggggggtac 
tgagacacaa gaatcgcctg 
aggagcagca gcagcagcag 
acccgagagc cggcctggac 
agaactgaga cgcattttga 
ggagcggggg gcttgtgagg 
cagacaccgt cctgagtgta 
tgatctgctg gcgcagaagt 
ggatgatttt gaggaggcta 
gtacaagatc agcaaacttg 
cgaggtggag atagatacgg 
gccgggggtg cttggcatgg 
ttttagcggt acggttttcc 
tgggtttaac aatacctgtg 
ttactgctgc tggaaggggg 
cctctttgaa aggtgtacct 
tgtggcctcc gactgtggtt 
catggtatgt ggcaactgcg 
ctgtcacctg ctgaagacca 
tgagcataac atactgaccc 
accttaccaa tgcaatttga 
ggtgaacctg aacggggtgt 
tgagacccgc accaggtgca 
tgtgatgctg gatgtgaccg 
cgctgagttt ggctctagcg 



cccttttcgt gaaagactcg 
aaaagacgac atgaaacaac 
ttggggaaag attttccata 
tgcaaaaact catggcatga 
ccagggctta agtaagtttt 
ggtttcctga cttggtttgg 
ctatgttctt ttggaattta 
aagagtgctg attttttgag 
gtgtctgttc gccatcccgt 
ccgcagaccc cggcgaccct 
cctcggataa gtgacccttg 
tttcttgtct ggctatcatc 
actgaaaatg agacatatta 
tcttttggac cagctgatcg 
tgaaccacct acccttcacg 
cgaggaggcg gtttcgcaga 
tgacttactc acttttccgc 
gcccgagcag ccggagcaga 
ggtgatcgat cttacctgcc 
999tgaggag tttgtgttag 
ttatcaccgg aggaatacgg 
ctgtggcatg tttgtctaca 
tttggtgtgg taattttttt 
gtgatttttt taaaaggtcc 
cctgcaagac ctacccgccg 
cctgtgtcta gagaatgcaa 
cctcctgaga tacacccggt 
ggtgggcgtc gccaggctgt 
ttggacttga gctgtaaacg 
ttaacgcctt tgtttgctga 
aacttgcatg gcgtgttaaa 
atcttggtta catctgacct 
cgtaacttgc tggaacagag 
tcatcccagg caaagttagt 
cttttgaaat cctgtggtga 
caagagaagg tcatcaagac 
gcttttttga gttttataaa 
ctgctggatt ttctggccat 
ctactgttgt cttccgtccg 
gaggaagcca ggcggcggcg 
cctcgggaat gaatgttgta 
caattacaga ggatgggcag 
ctacagagga ggctaggaat 
ttacttttca acagatcaag 
attccataga gcagctgacc 
ttagggtata tgcaaaggtg 
taaatatcag gaattgttgc 
aggatagggt ggcctttaga 
acggggtggt tattatgaat 
tggccaatac caaccttatc 
tggaagcctg gaccgatgta 
tggtgtgtcg ccccaaaagc 
tgggtatcct gtctgagggt 
gcttcatgct agtgaaaagc 
aggacagggc ctctcagatg 
ttcacgtagc cagccactct 
gctgttcctt gcatttgggt 
gtcacactaa gatattgctt 
ttgacatgac catgaagatc 
gaccctgcga gtgtggcggt 
aggagctgag gcccgatcac 
atgaagatac agattgaggt 



ccagagctag acctccttgg 780 
aggtacatga ttatatttat 840 
ccaaggaggg gacagtggct 900 
gttattatga atagccttta 960 
tggttacaaa ctgttcttaa 1020 
tatcaaaggt tctgatctga 10 80 
tccaaatctt atgtaaatgc 1140 
taaacttgca acagtcctaa 12 00 
ctccgctcgt cacttatcct 1260 
caggtcggcc gactgcggca 132 0 
tctctatttc tactatttgg 1380 
acaagagcgg aacggactca 1440 
tctgccacgg aggtgttatt 1500 
aagaggtact ggctgataat 1560 
aactgtatga tttagacgtg 162 0 
tttttcccga ctctgtaatg 1680 
cggcgcccgg ttctccggag 1740 
gagccttggg tccggtttct 1800 
acgaggctgg ctttccaccc 1860 
attatgtgga gcaccccggg 192 0 
gggacccaga tattatgtgt 1980 
gtaagtgaaa attatgggca 2 040 
tttaattttt acagttttgt 2100 
tgtgtctgaa cctgagcctg 2160 
tcctaaaatg gcgcctgcta 222 0 
tagtagtacg gatagctgtg 22 80 
ggtcccgctg tgccccatta 2340 
ggaatgtatc gaggacttgc 2400 
ccccaggcca taaggtgtaa 2460 
atgagttgat gtaagtttaa 2 52 0 
tggggcgggg cttaaagggt 2 580 
catggaggct tgggagtgtt 2 640 
ctctaacagt acctcttggt 270 0 
ctgcagaatt aaggaggatt 2 760 
gctgtttgat tctttgaatc 2820 
tttggatttt tccacaccgg 2 8 80 
ggataaatgg agcgaagaaa 2 940 
gcatctgtgg agagcggttg 3 0 00 
cccggcgata ataccgacgg 3 060 
gcaggagcag agcccatgga 312 0 
caggtggctg aactgtatcc 3180 
gggctaaagg gggtaaagag 3240 
ctagctttta gcttaatgac 3300 
gataattgcg ctaatgagct 3360 
acttactggc tgcagccagg 342 0 
gcacttaggc cagattgcaa 3480 
tacatttctg ggaacggggc 3540 
tgtagcatga taaatatgtg 3600 
gtaaggttta ctggccccaa 3 660 
ctacacggtg taagcttcta 3720 
agggttcggg gctgtgcctt 3780 
agggcttcaa ttaagaaatg 3 840 
aactccaggg tgcgccacaa 3900 
gtggctgtga ttaagcataa 3960 
ctgacctgct cggacggcaa 4020 
cgcaaggcct ggccagtgtt 40 80 
aacaggaggg gggtgttcct 4140 
gagcccgaga gcatgtccaa 4200 
tggaaggtgc tgaggtacga 4260 
aaacatatta ggaaccagcc 4320 
ttggtgctgg cctgcacccg 43 80 
actgaaatgt gtgggcgtgg 4440 
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cttaagggtg ggaaagaata tataaggtgg gggtcttatg tagttttgta tctgttttgc 4500 

agcagccgcc gccgccatga gcaccaactc gtttgatgga agcattgtga gctcatattt 4560 

gacaacgcgc atgcccccat gggccggggt gcgtcagaat gtgatgggct ccagcattga 462 0 

tggtcgcccc gtcctgcccg caaactctac taccttgacc tacgagaccg tgtctggaac 4680 

gccgttggag actgcagcct ccgccgccgc ttcagccgct gcagccaccg cccgcgggat 474 0 

tgtgactgac tttgctttcc tgagcccgct tgcaagcagt gcagcttccc gttcatccgc 4 800 

ccgcgatgac aagttgacgg ctcttttggc acaattggat tctttgaccc gggaacttaa 4860 

tgtcgtttct cagcagctgt tggatctgcg ccagcaggtt tctgccctga aggcttcctc 4920 

ccctcccaat gcggtttaaa acataaataa aaaaccagac tctgtttgga tttggatcaa 4980 

gcaagtgtct tgctgtctct cgagggatct ttgtgaagga accttacttc tgtggtgtga 5040 

cataattgga caaactacct acagagattt aaagctctaa ggtaaatata aaatttttaa 510 0 

gtgtataatg tgttaaacta ctgattctaa ttgtttgtgt attttagatt ccaacctatg 5160 

gaactgatga atgggagcag tggtggaatg cctttaatga ggaaaacctg ttttgctcag 522 0 

aagaaatgcc atctagtgat gatgaggcta ctgctgactc tcaacattct actcctccaa 5280 

aaaagaagag aaaggtagaa gaccccaagg actttccttc agaattgcta agttttttga 5340 

gtcatgctgt gtttagtaat agaactcttg cttgctttgc tatttacacc acaaaggaaa 540 0 

aagctgcact gctatacaag aaaattatgg aaaaatattc tgtaaccttt ataagtaggc 5460 

ataacagtta taatcataac atactgtttt ttcttactcc acacaggcat agagtgtctg 5520 

ctattaataa ctatgctcaa aaattgtgta cctttagctt tttaatttgt aaaggggtta 558 0 

ataaggaata tttgatgtat agtgccttga ctagagatca taatcagcca taccacattt 5640 

gtagaggttt tacttgcttt aaaaaacctc ccacacctcc ccctgaacct gaaacataaa 5700 

atgaatgcaa ttgttgttgt taacttgttt attgcagctt ataatggtta caaataaagc 5760 

aatagcatca caaatttcac aaataaagca tttttttcac tgcattctag ttgtggtttg 5820 

tccaaactca tcaatgtatc ttatcatgtc tggatccggc tgtggaatgt gtgtcagtta 5880 

gggtgtggaa agtccccagg ctccccagca ggcagaagta tgcaaagcat gcatctcaat 5940 

tagtcagcaa ccaggtgtgg aaagtcccca ggctccccag caggcagaag tatgcaaagc 6000 

atgcatctca attagtcagc aaccatagtc ccgcccctaa ctccgcccat cccgccccta 6060 

actccgccca gttccgccca ttctccgccc catggctgac taattttttt tatttatgca 6120 

gaggccgagg ccgcctcggc ctctgagcta ttccagaagt agtgaggagg cttttttgga 6180 

ggcctaggct tttgcaaaaa gcttggacac aagacaggct tgcgagatat gtttgagaat 6240 

accactttat cccgcgtcag ggagaggcag tgcgtaaaaa gacgcggact catgtgaaat 6300 

actggttttt agtgcgccag atctctataa tctcgcgcaa cctattttcc cctcgaacac 6360 

tttttaagcc gtagataaac aggctgggac acttcacatg agcgaaaaat acatcgtcac 6420 

ctgggacatg ttgcagatcc atgcacgtaa actcgcaagc cgactgatgc cttctgaaca 6480 

atggaaaggc attattgccg taagccgtgg cggtctggta ccgggtgcgt tactggcgcg 654 0 

tgaactgggt attcgtcatg tcgataccgt ttgtatttcc agctacgatc acgacaacca 6600 

gcgcgagctt aaagtgctga aacgcgcaga aggcgatggc gaaggcttca tcgttattga 6660 

tgacctggtg gataccggtg gtactgcggt tgcgattcgt gaaatgtatc caaaagcgca 672 0 

ctttgtcacc atcttcgcaa aaccggctgg tcgtccgctg gttgatgact atgttgttga 6780 

tatcccgcaa gatacctgga ttgaacagcc gtgggatatg ggcgtcgtat tcgtcccgcc 6 840 

aatctccggt cgctaatctt ttcaacgcct ggcactgccg ggcgttgttc tttttaactt 6900 

caggcgggtt acaatagttt ccagtaagta ttctggaggc tgcatccatg acacaggcaa 6960 

acctgagcga aaccctgttc aaaccccgct ttaaacatcc tgaaacctcg acgctagtcc 702 0 

gccgctttaa tcacggcgca caaccgcctg tgcagtcggc ccttgatggt aaaaccatcc 7080 

ctcactggta tcgcatgatt aaccgtctga tgtggatctg gcgcggcatt gacccacgcg 7140 

aaatcctcga cgtccaggca cgtattgtga tgagcgatgc cgaacgtacc gacgatgatt 7200 

tatacgatac ggtgattggc taccgtggcg gcaactggat ttatgagtgg gccccggatc 7260 

tttgtgaagg aaccttactt ctgtggtgtg acataattgg acaaactacc tacagagatt 7320 

taaagctcta aggtaaatat aaaattttta agtgtataat gtgttaaact actgattcta 7380 

attgtttgtg tattttagat tccaacctat ggaactgatg aatgggagca gtggtggaat 7440 

gcctttaatg aggaaaacct gttttgctca gaagaaatgc catctagtga tgatgaggct 7500 

actgctgact ctcaacattc tactcctcca aaaaagaaga gaaaggtaga agaccccaag 7560 

gactttcctt cagaattgct aagttttttg agtcatgctg tgtttagtaa tagaactctt 7620 

gcttgctttg ctatttacac cacaaaggaa aaagctgcac tgctatacaa gaaaattatg 7680 

gaaaaatatt ctgtaacctt tataagtagg cataacagtt ataatcataa catactgttt 7740 

tttcttactc cacacaggca tagagtgtct gctattaata actatgctca aaaattgtgt 7800 
acctttagct ttttaatttg taaaggggtt aataaggaat atttgatgta tagtgccttg 7860 

actagagatc ataatcagcc ataccacatt tgtagaggtt ttacttgctt taaaaaacct 7920 

cccacacctc cccctgaacc tgaaacataa aatgaatgca attgttgttg ttaacttgtt 7980 

tattgcagct tataatggtt acaaataaag caatagcatc acaaatttca caaataaagc 8040 

atttttttca ctgcattcta gttgtggttt gtccaaactc atcaatgtat cttatcatgt 8100 

ctggatcccc aggaagctcc tctgtgtcct cataaaccct aacctcctct acttgagagg 8160 
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acattccaat cataggctgc ccatccaccc tctgtgtcct cctgttaatt aggtcactta 8220 
acaaaaagga aattgggtag gggtttttca cagaccgctt tctaagggta attttaaaat 82 80 
atctgggaag tcccttccac tgctgtgttc cagaagtgtt ggtaaacagc ccacaaatgt 8340 
caacagcaga aacatacaag ctgtcagctt tgcacaaggg cccaacaccc tgctcatcaa 8400 
gaagcactgt ggttgctgtg ttagtaatgt gcaaaacagg aggcacattt tccccacctg 8460 
tgtaggttcc aaaatatcta gtgttttcat ttttacttgg atcaggaacc cagcactcca 8520 
ctggataagc attatcctta tccaaaacag ccttgtggtc agtgttcatc tgctgactgt 8580 
caactgtagc attttttggg gttacagttt gagcaggata tttggtcctg tagtttgcta 8640 
acacaccctg cagctccaaa ggttccccac caacagcaaa aaaatgaaaa tttgaccctt 8700 
gaatgggttt tccagcacca ttttcatgag ttttttgtgt ccctgaatgc aagtttaaca 8760 
tagcagttac- cccaataacc tcagttttaa cagtaacagc ttcccacatc aaaatatttc 8 82 0 
cacaggttaa gtcctcattt aaattaggca aaggaattct tgaagacgaa agggcctcgt 8 880 
gatacgccta tttttatagg ttaatgtcat gataataatg gtttcttaga cgtcaggtgg 8940 
cacttttcgg ggaaatgtgc gcggaacccc tatttgttta tttttctaaa tacattcaaa 9000 
tatgtatccg ctcatgagac aataaccctg ataaatgctt caataatatt gaaaaaggaa 9060 
gagtatgagt attcaacatt tccgtgtcgc ccttattccc ttttttgcgg cattttgcct 9120 
tcctgttttt gctcacccag aaacgctggt gaaagtaaaa gatgctgaag atcagttggg 9180 
tgcacgagtg ggttacatcg aactggatct caacagcggt aagatccttg agagttttcg 9240 
ccccgaagaa cgttttccaa tgatgagcac ttttaaagtt ctgctatgtg gcgcggtatt 9300 
atcccgtgtt gacgccgggc aagagcaact cggtcgccgc atacactatt ctcagaatga 9360 
cttggttgag tactcaccag tcacagaaaa gcatcttacg gatggcatga cagtaagaga 9420 
attatgcagt gctgccataa ccatgagtga taacactgcg gccaacttac ttctgacaac 9480 
gatcggagga ccgaaggagc taaccgcttt tttgcacaac atgggggatc atgtaactcg 9540 
ccttgatcgt tgggaaccgg agctgaatga agccatacca aacgacgagc gtgacaccac 9600 
gatgcctgca gcaatggcaa caacgttgcg caaactatta actggcgaac tacttactct 9660 
agcttcccgg caacaattaa tagactggat ggaggcggat aaagttgcag gaccacttct 972 0 
gcgctcggcc cttccggctg gctggtttat tgctgataaa tctggagccg gtgagcgtgg 9780 
gtctcgcggt atcattgcag cactggggcc agatggtaag ccctcccgta tcgtagttat 9840 
ctacacgacg gggagtcagg caactatgga tgaacgaaat agacagatcg ctgagatagg 9900 
tgcctcactg attaagcatt ggtaactgtc agaccaagtt tactcatata tactttagat 9960 
tgatttaaaa cttcattttt aatttaaaag gatctaggtg aagatccttt ttgataatct 10020 
catgaccaaa atcccttaac gtgagttttc gttccactga gcgtcagacc ccgtagaaaa 100 80 
gatcaaagga tcttcttgag atcctttttt tctgcgcgta atctgctgct tgcaaacaaa 10140 
aaaaccaccg ctaccagcgg tggtttgttt gccggatcaa gagctaccaa ctctttttcc 10200 
gaaggtaact ggcttcagca gagcgcagat accaaatact gtccttctag tgtagccgta 10260 
gttaggccac cacttcaaga actctgtagc accgcctaca tacctcgctc tgctaatcct 10320 
gttaccagtg gctgctgcca gtggcgataa gtcgtgtctt accgggttgg actcaagacg 10380 
atagttaccg gataaggcgc agcggtcggg ctgaacgggg ggttcgtgca cacagcccag 10440 
cttggagcga acgacctaca ccgaactgag atacctacag cgtgagctat gagaaagcgc 10500 
cacgcttccc gaagggagaa aggcggacag gtatccggta agcggcaggg tcggaacagg 10560 
agagcgcacg agggagcttc cagggggaaa cgcctggtat ctttatagtc ctgtcgggtt 10620 
tcgccacctc tgacttgagc gtcgattttt gtgatgctcg tcaggggggc ggagcctatg 10680 
gaaaaacgcc agcaacgcgg cctttttacg gttcctggcc ttttgctggc cttttgctca 10740 
catgttcttt cctgcgttat cccctgattc tgtggataac cgtattaccg cctttgagtg 10800 
agctgatacc gctcgccgca gccgaacgac cgagcgcagc gagtcagtga gcgaggaagc 10860 
ggaagagcgc ctgatgcggt attttctcct tacgcatctg tgcggtattt cacaccgcat 10920 
atggtgcact ctcagtacaa tctgctctga tgccgcatag ttaagccagt atacactccg 10980 
ctatcgctac gtgactgggt catggctgcg ccccgacacc cgccaacacc cgctgacgcg 11040 
ccctgacggg cttgtctgct cccggcatcc gcttacagac aagctgtgac cgtctccggg 11100 
agctgcatgt gtcagaggtt ttcaccgtca tcaccgaaac gcgcgaggca gc 11152 

<210> 13 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<400> 13 

gacggatcgg gagatctcc 



19 
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<210> 14 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 14 

ccgcctcaga agccatagag cc 22 



<210> 15 
<211> 14455 
<212> DNA 
- <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: plasmid 
<400> 15 

aagcttgggc agaaatggtt gaactcccga gagtgtccta cacctagggg agaagcagcc 60 

aaggggttgt ttcccaccaa ggacgacccg tctgcgcaca aacggatgag cccatcagac 120 

aaagacatat tcattctctg ctgcaaactt ggcatagctc tgctttgcct ggggctattg 180 

ggggaagttg cggttcgtgc tcgcagggct ctcacccttg actcttttaa tagctcttct 24 0 

gtgcaagatt acaatctaaa caattcggag aactcgacct tcctcctgag gcaaggacca 30 0 

cagccaactt cctcttacaa gccgcatcga ttttgtcctt cagaaataga aataagaatg 360 

cttgctaaaa attatatttt taccaataag accaatccaa taggtagatt attagttact 420 

atgttaagaa atgaatcatt atcttttagt actattttta ctcaaattca gaagttagaa 480 

atgggaatag aaaatagaaa gagacgctca acctcaattg aagaacaggt gcaaggacta 540 

ttgaccacag gcctagaagt aaaaaaggga aaaaagagtg tttttgtcaa aataggagac 600 

aggtggtggc aaccagggac ttatagggga ccttacatct acagaccaac agatgccccc 660 

ttaccatata caggaagata tgacttaaat tgggataggt gggttacagt caatggctat 72 0 

aaagtgttat atagatccct cccttttcgt gaaagactcg ccagagctag acctccttgg 78 0 

tgtatgttgt ctcaagaaga aaaagacgac atgaaacaac aggtacatga ttatatttat 84 0 

ctaggaacag gaatgcactt ttggggaaag attttccata ccaaggaggg gacagtggct 900 

ggactaatag aacattattc tgcaaaaact catggcatga gttattatga atagccttta 960 

ttggcccaac cttgcggttc ccagggctta agtaagtttt tggttacaaa ctgttcttaa 1020 

aacgaggatg tgagacaagt ggtttcctga cttggtttgg tatcaaaggt tctgatctga 10 80 

gctctgagtg ttctattttc ctatgttctt ttggaattta tccaaatctt atgtaaatgc 1140 

ttatgtaaac caagatataa aagagtgctg attttttgag taaacttgca acagtcctaa 12 00 

cattcacctc ttgtgtgttt gtgtctgttc gccatcccgt ctccgctcgt cacttatcct 1260 

tcactttcca gagggtcccc ccgcagaccc cggcgaccct caggtcggcc gactgcggca 1320 

gctggcgccc gaacagggac cctcggataa gtgacccttg tctctatttc tactatttgg 13 80 

tgtttgtctt gtattgtctc tttcttgtct ggctatcatc acaagagcgg aacggactca 1440 

ccatagggac caagctagcg actgaaaatg agacatatta tctgccacgg aggtgttatt 1500 

accgaagaaa tggccgccag tcttttggac cagctgatcg aagaggtact ggctgataat 1560 

cttccacctc ctagccattt tgaaccacct acccttcacg aactgtatga tttagacgtg 1620 

acggcccccg aagatcccaa cgaggaggcg gtttcgcaga tttttcccga ctctgtaatg 16 80 

ttggcggtgc aggaagggat tgacttactc acttttccgc cggcgcccgg ttctccggag 1740 

ccgcctcacc tttcccggca gcccgagcag ccggagcaga gagccttggg tccggtttct 1800 

atgccaaacc ttgtaccgga ggtgatcgat cttacctgcc acgaggctgg ctttccaccc 1860 

agtgacgacg aggatgaaga gggtgaggag tttgtgttag attatgtgga gcaccccggg 1920 

cacggttgca ggtcttgtca ttatcaccgg aggaatacgg gggacccaga tattatgtgt 1980 

tcgctttgct atatgaggac ctgtggcatg tttgtctaca gtaagtgaaa attatgggca 2040 

gtgggtgata gagtggtggg tttggtgtgg taattttttt tttaattttt acagttttgt 2100 

ggtttaaaga attttgtatt gtgatttttt taaaaggtcc tgtgtctgaa cctgagcctg 2160 

agcccgagcc agaaccggag cctgcaagac ctacccgccg tcctaaaatg gcgcctgcta 2220 

tcctgagacg cccgacatca cctgtgtcta gagaatgcaa tagtagtacg gatagctgtg 22 80 

actccggtcc ttctaacaca cctcctgaga tacacccggt ggtcccgctg tgccccatta 2340 

aaccagttgc cgtgagagtt ggtgggcgtc gccaggctgt ggaatgtatc gaggacttgc 2400 

ttaacgagcc tgggcaacct ttggacttga gctgtaaacg ccccaggcca taaggtgtaa 2460 
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acctgtgatt 
taaagggtga 
atataatgcg 
tggaagattt 
tttggaggtt 
acaagtggga 
tgggtcacca 
ggcgcgctgc 
cccatctgag 
tgagacacaa 
aggagcagca 
acccgagagc 
agaactgaga 
ggagcggggg 
cagacaccgt 
tgatctgctg 
ggatgatttt 
gtacaagatc 
cgaggtggag 
gccgggggtg 
ttttagcggt 
tgggtttaac 
ttactgctgc 
cctctttgaa 
tgtggcctcc 
catggtatgt 
ctgtcacctg 
tgagcataac 
accttaccaa 
ggtgaacctg 
tgagacccgc 
tgtgatgctg 
cgctgagttt 
cttaagggtg 
. agcagccgcc 
gacaacgcgc 
tggtcgcccc 
gccgttggag 
tgtgactgac 
ccgcgatgac 
tgtcgtttct 
ccctcccaat 
gcaagtgtct 
cataattgga 
gtgtataatg 
gaactgatga 
aagaaatgcc 
aaaagaagag 
gtcatgctgt 
aagctgcact 
ataacagtta 
ctattaataa 
ataaggaata 
gtagaggttt 
atgaatgcaa 
aatagcatca 
tccaaactca 

gggtgtggaa 

tagtcagcaa 
atgcatctca 
actccgccca 
gaggccgagg 



gcgtgtgtgg 
gataatgttt 
ccgtgggcta 
ttctgctgtg 
tctgtggggc 
atttgaagag 
ggcgcttttc 
ggctgctgtt 

cggggggtac 

gaatcgcctg 
gcagcagcag 
cggcctggac 
cgcattttga 
gcttgtgagg 
cctgagtgta 
gcgcagaagt 
gaggaggcta 
agcaaacttg 
atagatacgg 
cttggcatgg 
acggttttcc 
aatacctgtg 
tggaaggggg 
aggtgtacct 
gactgtggtt 
ggcaactgcg 
ctgaagacca 
atactgaccc 
tgcaatttga 
aacggggtgt 
accaggtgca 
gatgtgaccg 
ggctctagcg 
ggaaagaata 
gccgccatga 
atgcccccat 
gtcctgcccg 
actgcagcct 
tttgctttcc 
aagttgacgg 
cagcagctgt 
gcggtttaaa 
tgctgtctct 
caaactacct 
tgttaaacta 
atgggagcag 
atctagtgat 
aaaggtagaa 
gtttagtaat 
gctatacaag 
taatcataac 
ctatgctcaa 
tttgatgtat 
tacttgcttt 
ttgttgttgt 
caaatttcac 
tcaatgtatc 
agtccccagg 
ccaggtgtgg 
attagtcagc 
gttccgccca 
ccgcctcggc 



ttaacgcctt 
aacttgcatg 
atcttggtta 
cgtaacttgc 
tcatcccagg 
cttttgaaat 
caagagaagg 
gcttttttga 
ctgctggatt 
ctactgttgt 
gaggaagcca 
cctcgggaat 
caattacaga 
ctacagagga 
ttacttttca 
attccataga 
ttagggtata 
taaatatcag 
aggatagggt 

ac ggggtggt 

tggccaatac 
tggaagcctg 

tggtgtgtcg 

tgggtatcct 
gcttcatgct 
aggacagggc 
ttcacgtagc 
gctgttcctt 
gtcacactaa 
ttgacatgac 
gaccctgcga 
aggagctgag 
atgaagatac 
tataaggtgg 
gcaccaactc 
gggccggggt 
caaactctac 
ccgccgccgc 
tgagcccgct 
ctcttttggc 
tggatctgcg 
acataaataa 
cgagggatct 
acagagattt 
ctgattctaa 
tggtggaatg 
gatgaggcta 
gaccccaagg 
agaactcttg 
aaaattatgg 
atactgtttt 
aaattgtgta 
agtgccttga 
aaaaaacctc 
taacttgttt 
aaataaagca 
ttatcatgtc 
ctccccagca 
aaagtcccca 
aaccatagtc 
ttctccgccc 
ctctgagcta 



tgtttgctga 
gcgtgttaaa 
catctgacct 
tggaacagag 
caaagttagt 
cctgtggtga 
tcatcaagac 
gttttataaa 
ttctggccat 
cttccgtccg 
ggcggcggcg 
gaatgttgta 
ggatgggcag 
ggctaggaat 
acagatcaag 
gcagctgacc 
tgcaaaggtg 
gaattgttgc 
ggcctttaga 
tattatgaat 
caaccttatc 
gaccgatgta 
ccccaaaagc 
gtctgagggt 
agtgaaaagc 
ctctcagatg 
cagccactct 
gcatttgggt 
gatattgctt 
catgaagatc 
gtgtggcggt 
gcccgatcac 
agattgaggt 
gggtcttatg 
gtttgatgga 
gcgtcagaat 
taccttgacc 
ttcagccgct 
tgcaagcagt 
acaattggat 
ccagcaggtt 
aaaaccagac 
ttgtgaagga 
aaagctctaa 
ttgtttgtgt 
cctttaatga 
ctgctgactc 
actttccttc 
cttgctttgc 
aaaaatattc 
ttcttactcc 
cctttagctt 
ctagagatca 
ccacacctcc 
attgcagctt 
tttttttcac 
tggatccggc 
ggcagaagta 
ggctccccag 
ccgcccctaa 
catggctgac 
ttccagaagt 



atgagttgat 
tggggcgggg 
catggaggct 
ctctaacagt 
ctgcagaatt 
gctgtttgat 
tttggatttt 
ggataaatgg 
gcatctgtgg 
cccggcgata 
gcaggagcag 
caggtggctg 
gggctaaagg 
ctagctttta 
gataattgcg 
acttactggc 
gcacttaggc 
tacatttctg 
tgtagcatga 
gtaaggttta 
ctacacggtg 
agggttcggg 
agggcttcaa 
aactccaggg 
gtggctgtga 
ctgacctgct 
cgcaaggcct 
aacaggaggg 
gagcccgaga 
tggaaggtgc 
aaacatatta 
ttggtgctgg 
actgaaatgt 
tagttttgta 
agcattgtga 
gtgatgggct 
tacgagaccg 
gcagccaccg 
gcagcttccc 
tctttgaccc 
tctgccctga 
tctgtttgga 
accttacttc 
ggtaaatata 
attttagatt 
ggaaaacctg 
tcaacattct 
agaattgcta 
tatttacacc 
tgtaaccttt 
acacaggcat 
tttaatttgt 
taatcagcca 
ccctgaacct 
ataatggtta 
tgcattctag 
tgtggaatgt 
tgcaaagcat 
caggcagaag 
ctccgcccat 
taattttttt 
agtgaggagg 



gtaagtttaa 
cttaaagggt 
tgggagtgtt 
acctcttggt 
aaggaggatt 
tctttgaatc 
tccacaccgg 
agcgaagaaa 
agagcggttg 
ataccgacgg 
agcccatgga 
aactgtatcc 
gggtaaagag 
gcttaatgac 
ctaatgagct 
tgcagccagg 
cagattgcaa 
ggaacggggc 
taaatatgtg 
ctggccccaa 
taagcttcta 
gctgtgcctt 
ttaagaaatg 
tgcgccacaa 
ttaagcataa 
cggacggcaa 
ggccagtgtt 
gggtgttcct 
gcatgtccaa 
tgaggtacga 
ggaaccagcc 
cctgcacccg 
gtgggcgtgg 
tctgttttgc 
gct.catattt 
ccagcattga 
tgtctggaac 
cccgcgggat 
gttcatccgc 
gggaacttaa 
aggcttcctc 
tttggatcaa 
tgtggtgtga 
aaatttttaa 
ccaacctatg 
ttttgctcag 
actcctccaa 
agttttttga 
acaaaggaaa 
ataagtaggc 
agagtgtctg 
aaaggggtta 
taccacattt 
gaaacataaa 
caaataaagc 
ttgtggtttg 
gtgtcagtta 
gcatctcaat 
tatgcaaagc 
cccgccccta 
tatttatgca 
cttttttgga 



2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
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ggcctaggct tttgcaaaaa gcttggacac aagacaggct tgcgagatat gtttgagaat 6240 
accactttat cccgcgtcag ggagaggcag tgcgtaaaaa gacgcggact catgtgaaat 63 0 0 
actggttttt agtgcgccag atctctataa tctcgcgcaa cctattttcc cctcgaacac 63 60 
tttttaagcc gtagataaac aggctgggac acttcacatg agcgaaaaat acatcgtcac 6420 
ctgggacatg ttgcagatcc atgcacgtaa actcgcaagc cgactgatgc cttctgaaca 6480 
atggaaaggc attattgccg taagccgtgg cggtctggta ccgggtgcgt tactggcgcg 6540 
tgaactgggt attcgtcatg tcgataccgt ttgtatttcc agctacgatc acgacaacca 6600 
gcgcgagctt aaagtgctga aacgcgcaga aggcgatggc gaaggcttca tcgttattga 6660 
tgacctggtg gataccggtg gtactgcggt tgcgattcgt gaaatgtatc caaaagcgca 672 0 
ctttgtcacc atcttcgcaa aaccggctgg tcgtccgctg gttgatgact atgttgttga 67 80 
tatcccgcaa gatacctgga ttgaacagcc gtgggatatg ggcgtcgtat tcgtcccgcc 6840 
aatctccggt cgctaatctt ttcaacgcct ggcactgccg ggcgttgttc tttttaactt 6900 
caggcgggtt acaatagttt ccagtaagta ttctggaggc tgcatccatg acacaggcaa 6960 
acctgagcga aaccctgttc aaaccccgct ttaaacatcc tgaaacctcg acgctagtcc 7020 
gccgctttaa tcacggcgca caaccgcctg tgcagtcggc ccttgatggt aaaaccatcc 7080 
ctcactggta tcgcatgatt aaccgtctga tgtggatctg gcgcggcatt gacccacgcg 7140 
aaatcctcga cgtccaggca cgtattgtga tgagcgatgc cgaacgtacc gacgatgatt 7200 
tatacgatac ggtgattggc taccgtggcg gcaactggat ttatgagtgg gccccggatc 7260 
tttgtgaagg aaccttactt ctgtggtgtg acataattgg acaaactacc tacagagatt 7320 
taaagctcta aggtaaatat aaaattttta agtgtataat gtgttaaact actgattcta 73 80 
attgtttgtg tattttagat tccaacctat ggaactgatg aatgggagca gtggtggaat 744 0 
gcctttaatg aggaaaacct gttttgctca gaagaaatgc catctagtga tgatgaggct 7500 
actgctgact ctcaacattc tactcctcca aaaaagaaga gaaaggtaga agaccccaag 7560 
gactttcctt cagaattgct aagtttfcttg agtcatgctg tgtttagtaa tagaactctt 762 0 
gcttgctttg ctatttacac cacaaaggaa aaagctgcac tgctatacaa gaaaattatg 7680 
gaaaaatatt ctgtaacctt tataagtagg cataacagtt ataatcataa catactgttt 7740 
tttcttactc cacacaggca tagagtgtct gctattaata actatgctca aaaattgtgt 780 0 
acctttagct ttttaatttg taaaggggtt aataaggaat atttgatgta tagtgccttg 7860 
actagagatc ataatcagcc ataccacatt tgtagaggtt ttacttgctt taaaaaacct 7920 
cccacacctc cccctgaacc tgaaacataa aatgaatgca attgttgttg ttaacttgtt 7980 
tattgcagct tataatggtt acaaataaag caatagcatc acaaatttca caaataaagc 8040 
atttttttca ctgcattcta gttgtggttt gtccaaactc atcaatgtat cttatcatgt 8100 
ctggatcccc aggaagctcc tctgtgtcct cataaaccct aacctcctct acttgagagg 8160 
acattccaat cataggctgc ccatccaccc tctgtgtcct cctgttaatt aggtcactta 8220 
acaaaaagga aattgggtag gggtttttca cagaccgctt tctaagggta attttaaaat 82 80 
atctgggaag tcccttccac tgctgtgttc cagaagtgtt ggtaaacagc ccacaaatgt 8340 
caacagcaga aacatacaag ctgtcagctt tgcacaaggg cccaacaccc tgctcatcaa 8400 
gaagcactgt ggttgctgtg ttagtaatgt gcaaaacagg aggcacattt tccccacctg 8460 
tgtaggttcc aaaatatcta gtgttttcat ttttacttgg atcaggaacc cagcactcca 852 0 
ctggataagc attatcctta tccaaaacag ccttgtggtc agtgttcatc tgctgactgt 8580 
caactgtagc attttttggg gttacagttt gagcaggata tttggtcctg tagtttgcta 8640 
acacaccctg cagctccaaa ggttccccac caacagcaaa aaaatgaaaa tttgaccctt 8700 
gaatgggttt tccagcacca ttttcatgag ttttttgtgt ccctgaatgc aagtttaaca 8760 
tagcagttac cccaataacc tcagttttaa cagtaacagc ttcccacatc aaaatatttc 8 820 
cacaggttaa gtcctcattt aaattaggca aaggaattct tgaagacgaa agggcctcgt 8880 
gatacgccta tttttatagg ttaatgtcat gataataatg gtttcttaga cgtcaggtgg 8940 
cacttttcgg ggaaatgtgc gcggaacccc tatttgttta tttttctaaa tacattcaaa 9000 
tatgtatccg ctcatgagac aataaccctg ataaatgctt caataatatt gaaaaaggaa 9060 
gagtatgagt attcaacatt tccgtgtcgc ccttattccc ttttttgcgg cattttgcct 9120 
tcctgttttt gctcacccag aaacgctggt gaaagtaaaa gatgctgaag atcagttggg 9180 
tgcacgagtg ggttacatcg aactggatct caacagcggt aagatccttg agagttttcg 9240 
ccccgaagaa cgttttccaa tgatgagcac ttttaaagtt ctgctatgtg gcgcggtatt 9300 
atcccgtgtt gacgccgggc aagagcaact cggtcgccgc atacactatt ctcagaatga 9360 
cttggttgag tactcaccag tcacagaaaa gcatcttacg gatggcatga cagtaagaga 9420 
attatgcagt gctgccataa ccatgagtga taacactgcg gccaacttac ttctgacaac 94 80 
gatcggagga ccgaaggagc taaccgcttt tttgcacaac atgggggatc atgtaactcg 9540 
ccttgatcgt tgggaaccgg agctgaatga agccatacca aacgacgagc gtgacaccac 9600 
gatgcctgca gcaatggcaa caacgttgcg caaactatta actggcgaac tacttactct 9660 
agcttcccgg caacaattaa tagactggat ggaggcggat aaagttgcag gaccacttct 9720 
gcgctcggcc cttccggctg gctggtttat tgctgataaa tctggagccg gtgagcgtgg 9780 
gtctcgcggt atcattgcag cactggggcc agatggtaag ccctcccgta tcgtagttat 9840 
ctacacgacg gggagtcagg caactatgga tgaacgaaat agacagatcg ctgagatagg 9900 
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tgcctcactg 
tgatttaaaa 
catgaccaaa 
gatcaaagga 
aaaaccaccg 
gaaggtaact 
gttaggccac 
gttaccagtg 
atagttaccg 
cttggagcga 
cacgcttccc 
agagcgcacg 
tcgccacctc 
gaaaaacgcc 
catgttcttt 
agctgatacc 
ggaagagcgc 
accgcctcag 
tcctccccct 
caatgcgatg 
gtcaaggaag 
gaggctgatc 
atgctcgagc 
gcaaatattt 
gcacttggag 
ggtttagttt 
ccgtgagatt 
gtgccttcag 
tttagtagca 
gatattggag 
ccacattttg 
ggagatggag 
atggcacctg 
gaatcaaatt 
ggtgaaccaa 
aaccccttgg 
aagtaaaggc 
ccctgtccta 
ataaggcgtc 
ccttgtgaat 
gtcacacctg 
gccccgtact 
cccaagctac 
tcggtggtgg 
ggtccttggg 
atggtgagtg 
gaggtaactg 
ggtgggctca 
ccgttgccca 
aaagagagta 
agaaaaggca 
tcttcagacg 
catccgccgt 
caaccccgac 
ctggttagac 
cctcggtggc 
accgcgaaga 
tcggtaccaa 
agtgggttct 
accgcccatt 
ttggtgccaa 
gtcaaaccgc 



attaagcatt 
cttcattttt 
atcccttaac 
tcttcttgag 
ctaccagcgg 
ggcttcagca 
cacttcaaga 
gctgctgcca 
gataaggcgc 
acgacctaca 
gaagggagaa 

agggagcttc 

tgacttgagc 
agcaacgcgg 
cctgcgttat 
gctcgccgca 
ctgatgcggt 
aagccataga 
tgctgtcctg 
caatttcctc 
gcacggggga 
agcgagctct 
ggccgcttct 
cattaatgta 
ttgtgtctcc 
tgtctccgtt 
ttggataagc 
taagatctcc 
ctccattttc 
ccaaactgcc 
ttaagaccaa 
ctggtgtggt 
tgctgtcaaa 
ctaggccatg 
attcaagccc 
cagtgcttag 
ctttgttgta 
gtcttagatt 
tgttttgaga 
caaaacccaa 
gaccagttgc 
ttagtcctag 
cagtggcagt 
tgagggggcc 
tggcaatgct 
tgttgcccgc 
tgaggggtgc 
cagtggttac 
ttttgagcgc 
ccccaggggg 
cagttggagg 
gtcttgcgcg 
ctcaagaccg 
cgccacccgc 
gcctttctcg 
ggagtaccgt 
gtttgtcctc 
gcttgggtct 
ctagttagcc 
tgcgtcaatg 
aacaaactcc 
tatccacgcc 



ggtaactgtc 
aatttaaaag 
gtgagttttc 
atcctttttt 
tggtttgttt 
gagcgcagat 
actctgtagc 
gtggcgataa 
agcggtcggg 
ccgaactgag 
aggcggacag 
cagggggaaa 
gtcgattttt 
cctttttacg 
cccctgattc 
gccgaacgac 
attttctcct 
gcccaccgca 
ccccacccca 
attttattag 
ggggcaaaca 
agcatttagg 
ttattcttgg 
gttgtggcca 
tgtttcctgt 
taagtaaact 
tgataggtta 
atttctaaag 
gtcaaatctt 
tttaacagcc 
agtgagttta 
ccacaaagtt 
actaaggcca 
gccaattttt 
atctcctgca 
gttaacctca 
gttaatatcc 
tagttggttt 
atcaatcctt 
ggctccagta 
tacggtcaaa 
ttttccattt 
agttagaggg 
tgatgtttgc 
aagtttggag 
gaccattaga 
agatatttcc 
attttgggag 
aagcatgcca 
actctcttga 
accggtttcc 
cttcatcttg 
cctactttaa 
tgccgcccgc 
agaggttttc 
tcggaggccg 
aaccgcgagc 
ccctatagtg 
agagagctct 
gggcggagtt 
cattgacgtc 
cattgatgta 



agaccaagtt 
gatctaggtg 
gttccactga 
tctgcgcgta 
gccggatcaa 
accaaatact 
accgcctaca 
gtcgtgtctt 
ctgaacgggg 
atacctacag 
gtatccggta 
cgcctggtat 
gtgatgctcg 
gttcctggcc 
tgtggataac 
cgagcgcagc 
tacgcatctg 
tccccagcat 
ccccccagaa 
gaaaggacag 
acagatggct 
tgacactata 
gcaatgtatg 
gaccagtccc 
gtaccgttta 
tgactgacaa 
ggcataaatc 
ttccaatatt 
ataataagat 
aaaactgaaa 
gcatctttct 
agcttatcat 
gttcctagtt 
gttttgaggg 
ttaatggcta 
agctttttgg 
aagttgtggg 
tgagcatcaa 
agtcctcctg 
actttagttt 
gtgtttaggt 
tgtgtataaa 
ggtgaggcag 
agggctagct 
tcgtgcacgg 
ggtgcggcgg 
aggtttatgt 
gtaaggttgc 
ttggaggtaa 
aacccattgg 
gtgtcatatg 
gatctcaagc 
ttacatcatc 
cacggtgctc 
cgatccggtc 
acgggtttcc 
ccaacagcga 
agtcgtatta 
gcttatatag 
gttacgacat 
aatggggtgg 
ctgccaaaac 



tactcatata 
aagatccttt 
gcgtcagacc 
atctgctgct 
gagctaccaa 
gtccttctag 
tacctcgctc 
accgggttgg 
ggttcgtgca 
cgtgagctat 
agcggcaggg 
ctttatagtc 
tcaggggggc 
ttttgctggc 
cgtattaccg 
gagtcagtga 
tgcggtattt 
gcctgctatt 
tagaatgaca 
tgggagtggc 
ggcaactaga 
gaatagggcc 
aaaaagtgta 
atgaaaatga 
gtgtaatggt 
tgttactttt 
caacagcgtt 
ctgggtccag 
gagcactttg 
ctgtagcaag 
ctgcatttag 
tatttttgtt 
taggaaccat 
gatttgtgtt 
tggctgtagc 
aattgtttga 
ctgagtttat 
acggataact 
ctacattaag 
gcaaggaagt 
cgtctgttac 
tgggctcttt 
tgatagtaag 
ttccttctga 
ttagcggggc 
cagccacagt 
ttgacttggt 
cggcctcgtc 
ctagaggttc 
gggatacaaa 
gatacacggg 
ctgccacacc 
agcagcacct 
agcctacctt 
gatgcggact 
gatccaagag 
gctcgaattc 
atttcgataa 
acctcccacc 
tttggaaagt 
agacttggaa 
cgcatcacca 



tactttagat 
ttgataatct 
ccgtagaaaa 
tgcaaacaaa 
ctctttttcc 
tgtagccgta 
tgctaatcct 
actcaagacg 
cacagcccag 
gagaaagcgc 
tcggaacagg 
ctgtcgggtt 
ggagcctatg 
cttttgctca 
cctttgagtg 
gcgaggaagc 
cacaccgcat 
gtcttcccaa 
cctactcaga 
accttccagg 
aggcacagtc 
ctctagatgc 
agaggatgtg 
catagagtat 
tagtgttaca 
ggcagtttta 
tgtataggct 
gaaggaattg 
aactgttcca 
tatttgactg 
tctacagtta 
tcctactgta 
agccttgttt 
tggtgcatta 
gtcaaacatc 
agctgtaaac 
aaaaagaggg 
aacatcaagt 
ttgcatattg 
attattaata 
atgcaaagga 
caagtcaatg 
ggtactgcta 
cactgtgagg 
ctgtgattgc 
tagggcttct 
ttttttgaga 
cagagagagg 
ggataggcgc 
gggaggagta 
gttgaaggta 
tcacctcgac 
ccgccagaaa 
gcgactgtga 
cgctcaggtc 
tactggaaag 
agate cgagc 
gecagtaage 
gtacacgcct 
cccgttgatt 
atccccgtga 
tggtaatagc 



9960 

10020 

10080 

10140 

10200 

10260 

10320 

10380 

10440 

10500 

10560 

10620 

10680 

10740 

10800 

10860 

10920 

10980 

11040 

11100 

11160 

11220 

11280 

11340 

11400 

11460 

11520 

11580 

11640 

11700 

11760 

11820 

11880 

11940 

12000 

12060 

12120 

12180 

12240 

12300 

12360 

12420 

12480 

12540 

12600 

12660 

12720 

12780 

12840 

12900 

12960 

13020 

13080 

13140 

13200 

13260 

13320 

13380 

13440 

13500 

13560 

13620 
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gatgactaat 
ataatgccag 
atacacttga 
atggaaagtc 

ggggtcgttg 

catatatggg 
tcaatgtcaa 
aagcagattc 
cgcgcactac 
ggcatcagag 
tctatggtgc 
ccgctatcgc 
gcgccctgac 
gggagctgca 



acgtagatgt 
gcgggccatt 
tgtactgcca 
cctattggcg 
ggcggtcagc 
ctatgaacta 
cgcgtatatc 
ttcatgcaat 
tcagcgacct 
cagattgtac 
actctcagta 
tacgtgactg 
gggcttgtct 
tgtgtcagag 



actgccaagt 
taccgtcatt 
agtgggcagt 
ttactatggg 
caggcgggcc 
atgaccccgt 
tggcccgtac 
tgtcggtcaa 
ccaacacaca 
tgagagtcga 
caatctgctc 
ggtcatggct 
gctcccggca 
gttttcaccg 



aggaaagtcc 
gacgtcaata 
ttaccgtaaa 
aacatacgtc 
atttaccgta 
aattgattac 
atcgcgaagc 
gccttgcctt 
agcagggagc 
ccatagggga 
tgatgccgca 
gcgccccgac 
tccgcttaca 
tcatcaccga 



cataaggtca 

gggggcgtac 

tagtccaccc 
attattgacg 
agttatgtaa 
tattaataac 
agcgcaaaac 
gttgtagctt 
agatactggc 
tcgggagatc 
tagttaagcc 
acccgccaac 
gacaagctgt 
aacgcgcgag 



tgtactgggc 
ttggcatatg 
attgacgtca 
tcaatgggcg 
cgcggaactc 
tagtcaataa 
gcctaaccct 
aaattttgct 
ttaactatgc 
tcccgatccg 
agtatacact 
acccgctgac 
gaccgtctcc 
gcagc 



13680 
13740 
13800 
13860 
13920 
13980 
14040 
14100 
14160 
14220 
14280 
14340 
14400 
14455 



<210> 16 

<211> 10610 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 



plasmid 



<400> 16 

gacggatcgg 

tctatgtcat 

tcctcttaca 

cgtgtttatt 

accaccacat 

caacctgcca 

aaaaagcatc 

ctgtcgagcc 

catgtcgctg 

cggcgaagga 

gcggtggtgc 

atacaacatg 

tgtcctccgg 

cagcaccaca 

ggggaccaca 

cctcataaac 

ccggtaccat 

ggccaaaacc 

gagagcccag 

acacaggcac 

atcccaggga 

cacgtaactc 

cagtatggta 

gcgccgagac 

agtcatattt 

ctcgccgctt 

ggcgccccct 

ccaccaccgc 

cgggaggagc 

aaaacctcaa 

tctacagcca 

caaacggccc 

ataaacattc 

atatctctaa 

ccctccacct 

cctgtataag 

gcagggccag 



gagatccgcg 
tttcatggga 
ctttttcata 
tttcaattgc 
agcttataca 
cctccctccc 
atatcatggg 
aaacgctcat 
tccagctgct 
gaagtccacg 
tgcagcagcg 
gcagtggtct 
gcacagcagc 
atattgttca 
gaacccacgt 
acgctggaca 
ataaacctct 
tgcccgccgg 
gactcgtaac 
acgtgcatac 
acaacccatt 
acgttgtgca 
gcgcgggttt 
aaccgagatc 
cctgaagcaa 
agatcgctct 
ggcttcgggt 
agaataagcc 
gggaagagct 
aatgaagatc 
aagaacagat 
tcacgtccaa 
cagcaccttc 
gcaaatcccg 
tcagcctcaa 
attcaaaagc 
ctgaacataa 



cggtacacag 
ctggtctggc 
cattgcccaa 
agaaaatttc 
gatcaccgta 
aacacacaga 
taacagacat 
cagtgatatt 
gagccacagg 
cctacatggg 
cgcgaataaa 
cctcagcgat 
gcaccctgat 
aaatcccaca 
ggccatcata 
taaacattac 
gattaaacat 
ctatacactg 
catggatcat 
acttcctcag 
cctgaatcag 
ttgtcaaagt 
ctgtctcaaa 
gtgttggtcg 
aaccaggtgc 
gtgtagtagt 
tctatgtaaa 
acacccagcc 
ggaagaacca 
tattaagtga 
aatggcattt 
gtggacgtaa 
aaccatgccc 
aatattaagt 
gcagcgaatc 
ggaacattaa 
tcgtgcaggt 



aattcaggag 
cacaactaca 
gaataaagaa 
aagtcatttt 
ccttaatcaa 
gtacacagtc 
attcttaggt 
aataaactcc 
ctgctgtcca 
ggtagagtca 
ctgctgccgc 
gattcgcacc 
ctcacttaaa 
gtgcaaggcg 
ccacaagcgc 
ctcttttggc 
ggcgccatcc 
cagggaaccg 
catgctcgtc 
gattacaagc 
cgtaaatccc 
gttacattcg 
aggaggtaga 
tagtgtcatg 
gggcgtgaca 
tgtagtatat 
ctccttcatg 
aacctacaca 
tgtttttttt 
acgcgctccc 
gtaagatgtt 
aggctaaacc 
aaataattct 
ccggccattg 
atgattgcaa 
caaaaatacc 
ctgcacggac 



acacaactcc 
ttaatgaaat 
tcgtttgtgt 
tcattcagta 
actcacagaa 
ctttctcccc 
gttatattcc 
ccgggcagct 
acttgcggtt 
taatcgtgca 
cgccgctccg 
gcccgcagca 
tcagcacagt 
ctgtatccaa 
aggtagatta 
atgttgtaat 
accaccatcc 
ggactggaac 
atgatatcaa 
tcctcccgcg 
acactgcagg 
ggcagcagcg 
cgatccctac 
ccaaatggaa 
aacagatctg 
ccactctctc 
cgccgctgcc 
ttcgttctgc 
tttattccaa 
ctccggtggc 
gcacaatggc 
cttcagggtg 
catctcgcca 
taaaaatctg 
aaattcaggt 
gcgatcccgt 
cagcgcggcc 



aagtgcatac 
atttgccaca 
tatgtttcaa 
gtatagcccc 
ccctagtatt 
ggctggcctt 
acacggtttc 
cacttaagtt 
gcttaacggg 
tcaggatagg 
tcctgcagga 
taaggcgcct 
aactgcagca 
agctcatggc 
agtggcgacc 
tcaccacctc 
taaaccagct 
aatgacagtg 
tgttggcaca 
ttagaaccat 
gaagacctcg 
gatgatcctc 
tgtacggagt 
cgccggacgt 
cgtctccggt 
aaagcatcca 
ctgataacat 
gagtcacaca 
aagattatcc 
gtggtcaaac 
ttccaaaagg 
aatctcctct 
ccttctcaat 
ctccagagcg 
tcctcacaga 
aggtcccttc 
acttccccgc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 
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caggaacctt 
ccagcgtagc 
ctcaaaaaat 
agataaaggc 
atgtctgcgg 
aagcctgtct 
cggcgtgacc 
tcatgtccgg 
gctaaaaagc 
acagccccca 
aaaccctcct 
cagcggcagc 
ctcgacacgg 
atatatagga 
cacgcgaacc 
ttccgttttc 
tcagtacaat 
tggaggtcgc 
acaattgcat 
ccagatatac 
cattagttca 
ctggctgacc 
taacgccaat 
acttggcagt 
gtaaatggcc 
agtacatcta 
atgggcgtgg 
atgggagttt 
ccccattgac 
tctggctaac 
gggagaccca 
gttgaggaca 
cgaacggtac 
tctcgagaaa 
gcagcgggtg 
aggcggtctt 
aagcgcgcaa 
ggtcctccaa 
agtccccctg 
cttgcgctca 
aatgtaacca 
tctgcacccc 
gtcgcgggca 
cttagcattg 
tcaggccccc 
actactgcca 
aaactaggac 
gtagcaactg 
gccttgggtt 
gattctcaaa 
ctaaatctaa 
aactacaaca 
aacctaagca 
gatgggcttg 
ggccatggcc 
agttttgaca 
tggaccacac 
actttggtct 
aaaggcagtt 
gacgaaaatg 
aatggagatc 
tcagcttatc 



gacaaaagaa 
cccgatgtaa 
caggcaaagc 
aggtaagctc 
gtttctgcat 
tacaacagga 
gtaaaaaaac 
agtcataatg 
gaccgaaata 
taggaggtat 
gcctaggcaa 
ctaacagtca 
caccagctca 
ctaaaaaatg 
tacgcccaga 
ccacgttacg 
ctgctctgat 
tgagtagtgc 
gaagaatctg 
gcgttgacat 
tagcccatat 
gcccaacgac 
agggactttc 
acatcaagtg 
cgcctggcat 
cgtattagtc 
atagcggttt 
gttttggcac 
gcaaatgggc 
tagagaaccc 
agcttggtac 
aactcttcgc 
tccgccaccg 
ggcgtctaac 
gcggtcgggg 
gagacggcgg 
gaccgtctga 
ctgtgccttt 
gggtactctc 
aaatgggcaa 
ctgtgagccc 
tcacagttac 
acacactcac 
ccacccaagg 
tcaccaccac 
ctggtagctt 
taaagtacgg 
gtccaggtgt 
ttgattcaca 
acagacgcct 
gactaggaca 
aaggccttta 
ctgccaaggg 
aatttggttc 
tagaatttga 
gcacaggtgc 
cagctccatc 
taacaaaatg 
tggctccaat 
gagtgctact 
ttactgaagg 
caaaatctca 



cccacactga 
gctttgttgc 
ctcgcgcaaa 
cggaaccacc 
aaacacaaaa 
aaaacaaccc 
tggtcaccgt 
taagactcgg 
gcccggggga 
aacaaaatta 
aatagcaccc 
gccttaccag 
atcagtcaca 
acgtaacggt 
aacgaaagcc 
taacttcccg 
gccgcatagt 
gcgagcaaaa 
cttagggtta 
tgattattga 
atggagttcc 
ccccgcccat 
cattgacgtc 
tatcatatgc 
tatgcccagt 
atcgctatta 
gactcacggg 
caaaatcaac 
ggtaggcgtg 
actgcttact 
cgagctcgga 
ggtctttcca 
agggacctga 
cagtcacagt 
ttgtttctgg 
atggtcgagg 
agataccttc 
tcttactcct 
tttgcgccta 
cggcctctct 
acctctcaaa 
ctcagaagcc 
catgcaatca 
acccctcaca 
cgatagcagt 
gggcattgac 
ggctcctttg 
gactattaat 
aggcaatatg 
tatacttgat 
gggccctctt 
cttgtttaca 
gttgatgttt 
acctaatgca 
ttcaaacaag 
cattacagta 
tcctaactgt 
tggcagtcaa 
atctggaaca 
aaacaattcc 
cacagcctat 
cggtaaaact 



ttatgacacg 

at 999 c 99 c 9 
aaagaaagca 
acagaaaaag 
taaaataaca 
ttataagcat 
gattaaaaag 
taaacacatc 
atacataccc 
ataggagaga 
tcccgctcca 
taaaaaagaa 
gtgtaaaaaa 
taaagtccac 
aaaaaaccca 
gatcctctcc 
taagccagta 
tttaagctac 
ggcgttttgc 
ctagttatta 
gcgttacata 
tgacgtcaat 
aatgggtgga 
caagtacgcc 
acatgacctt 
ccatggtgat 
gatttccaag 
gggactttcc 
tacggtggga 
ggcttatcga 
tctgaattcg 
gtactcttgg 
gcgagtccgc 
cgcaaggtag 
cggaggtgct 
tgaggtgtgg 
aaccccgtgt 
ccctttgtat 
tccgaacctc 
ctggacgagg 
aaaaccaagt 
ctaactgtgg 
caggccccgc 
gtgtcagaag 
acccttacta 
ttgaaagagc 
catgtaacag 
aatacttcct 
caacttaatg 
gttagttatc 
tttataaact 
gcttcaaaca 
gacgctacag 
ccaaacacaa 
gctatggttc 
ggaaacaaaa 
agactaaatg 
atacttgcta 
gttcaaagtg 
ttcctggacc 
acaaacgctg 
gccaaaagta 



catactcgga 
atataaaatg 
catcgtagtc 
acaccatttt 
aaaaaacatt 
aagacggact 
caccaccgac 
aggttgattc 
gcaggcgtag 
aaaacacata 
gaacaacata 
aacctattaa 
gggccaagtg 
aaaaaacacc 
caacttcctc 
cgatccccta 
tctgctccct 
aacaaggcaa 
gctgcttcgc 
atagtaatca 
acttacggta 
aatgacgtat 
ctatttacgg 
ccctattgac 
atgggacttt 
gcggttttgg 
tctccacccc 
aaaatgtcgt 
ggtctatata 
aattaatacg 
agctcgctgt 
atcggaaacc 
atcgaccgga 
gctgagcacc 
gctgatgatg 
caggcttgag 
atccatatga 
cccccaatgg 
tagttacctc 
ccggcaacct 
caaacataaa 
ctgccgccgc 
taaccgtgca 
gaaagctagc 
tcactgcctc 
ccatttatac 
acgacctaaa 
tgcaaactaa 
tagcaggagg 
cgtttgatgc 
cagcccacaa 
attccaaaaa 
ccatagccat 
atcccctcaa 
ctaaactagg 
ataatgataa 
cagagaaaga 
cagtttcagt 
ctcatcttat 
cagaatattg 
ttggatttat 
acattgtcag 



gctatgctaa 
caaggtgctg 
atgctcatgc 
tctctcaaac 
taaacattag 
acggccatgc 
agctcctcgg 
atcggtcagt 
agacaacatt 
aacacctgaa 
cagcgcttca 
aaaaacacca 
cagagcgagt 
cagaaaaccg 
aaatcgtcac 
tggtcgactc 
gcttgtgtgt 
ggcttgaccg 
gatgtacggg 
attacggggt 
aatggcccgc 
gttcccatag 
taaactgccc 
gtcaatgacg 
cctacttggc 
cagtacatca 
attgacgtca 
aacaactccg 
agcagagctc 
actcactata 
tgggctcgcg 
cgtcggcctc 
tcggaaaacc 
gtggcgggcg 
taattaaagt 
atccaagatg 
cacggaaacc 
gtttcaagag 
caatggcatg 
tacctcccaa 
cctggaaata 
acctctaatg 
cgactccaaa 
cctgcaaaca 
accccctcta 
acaaaatgga 
cactttgacc 
agttactgga 
actaaggatt 
tcaaaaccaa 
cttggatatt 
gcttgaggtt 
taatgcagga 
aacaaaaatt 
aactggcctt 
gctaactttg 
tgctaaactc 
tttggctgtt 
tataagattt 
gaactttaga 
gcctaaccta 
tcaagtttac 



2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 



WO 01/83729 PCT/EP01/04863 

19 



ttaaacggag acaaaactaa acctgtaaca ctaaccatta cactaaacgg tacacaggaa 60 00 
acaggagaca caactccaag tgcatactct atgtcatttt catgggactg gtctggccac 6060 
aactacatta atgaaatatt tgccacatcc tcttacactt tttcatacat tgcccaagaa 6120 
taaagaagcg gccgctcgag catgcatcta gagggcccta ttctatagtg tcacctaaat 6180 
gctagagctc gctgatcagc ctcgactgtg ccttctagtt gccagccatc tgttgtttgc 6240 
ccctcccccg tgccttcctt gaccctggaa ggtgccactc ccactgtcct ttcctaataa 6300 
aatgaggaaa ttgcatcgca ttgtctgagt aggtgtcatt ctattctggg gggtggggtg 63 60 
gggcaggaca gcaaggggga ggattgggaa gacaatagca ggcatgctgg ggatgcggtg 6420 
ggctctatgg cttctgaggc ggaaagaacc agctggggct ctagggggta tccccacgcg 64 80 
ccctgtagcg gcgcattaag cgcggcgggt gtggtggtta cgcgcagcgt gaccgctaca 6540 
cttgccagcg ccctagcgcc cgctcctttc gctttcttcc cttcctttct cgccacgttc 6600 
gccggctttc cccgtcaagc tctaaatcgg ggcatccctt tagggttccg atttagtgct 6660 
ttacggcacc tcgaccccaa aaaacttgat tagggtgatg gttcacgtag tgggccatcg 6720 
ccctgataga cggtttttcg ccctttgacg ttggagtcca cgttctttaa tagtggactc 67 80 
ttgttccaaa ctggaacaac actcaaccct atctcggtct attcttttga tttataaggg 6840 
attttgggga tttcggccta ttggttaaaa aatgagctga tttaacaaaa atttaacgcg 6900 
aattaattct gtggaatgtg tgtcagttag ggtgtggaaa gtccccaggc tccccaggca 6960 
ggcagaagta tgcaaagcat gcatctcaat tagtcagcaa ccaggtgtgg aaagtcccca 7020 
ggctccccag caggcagaag tatgcaaagc atgcatctca attagtcagc aaccatagtc 70 80 
ccgcccctaa ctccgcccat cccgccccta actccgccca gttccgccca ttctccgccc 7140 
catggctgac taattttttt tatttatgca gaggccgagg ccgcctctgc ctctgagcta 7200 
ttccagaagt agtgaggagg cttttttgga ggcctaggct tttgcaaaaa gctcccggga 7260 
gcttgtatat ccattttcgg atctgatcaa gagacaggat gaggatcgtt tcgcatgatt 732 0 
gaacaagatg gattgcacgc aggttctccg gccgcttggg tggagaggct attcggctat 7380 
gactgggcac aacagacaat cggctgctct gatgccgccg tgttccggct gtcagcgcag 7440 
gggcgcccgg ttctttttgt caagaccgac ctgtccggtg ccctgaatga actgcaggac 7500 
gaggcagcgc ggctatcgtg gctggccacg acgggcgttc cttgcgcagc tgtgctcgac 7560 
gttgtcactg aagcgggaag ggactggctg ctattgggcg aagtgccggg gcaggatctc 762 0 
ctgtcatctc accttgctcc tgccgagaaa gtatccatca tggctgatgc aatgcggcgg 7680 
ctgcatacgc ttgatccggc tacctgccca ttcgaccacc aagcgaaaca tcgcatcgag 774 0 
cgagcacgta ctcggatgga agccggtctt gtcgatcagg atgatctgga cgaagagcat 7800 
caggggctcg cgccagccga actgttcgcc aggctcaagg cgcgcatgcc cgacggcgag 7860 
gatctcgtcg tgacccatgg cgatgcctgc ttgccgaata tcatggtgga aaatggccgc 7 920 
ttttctggat tcatcgactg tggccggctg ggtgtggcgg accgctatca ggacatagcg 7980 
ttggctaccc gtgatattgc tgaagagctt ggcggcgaat gggctgaccg cttcctcgtg 8040 
ctttacggta tcgccgctcc cgattcgcag cgcatcgcct tctatcgcct tcttgacgag 8100 
ttcttctgag cgggactctg gggttcgaaa tgaccgacca agcgacgccc aacctgccat 8160 
cacgagattt cgattccacc gccgccttct atgaaaggtt gggcttcgga atcgttttcc 8220 
gggacgccgg ctggatgatc ctccagcgcg gggatctcat gctggagttc ttcgcccacc 8280 
ccaacttgtt tattgcagct tataatggtt acaaataaag caatagcatc acaaatttca 8340 
caaataaagc atttttttca ctgcattcta gttgtggttt gtccaaactc atcaatgtat 8400 
cttatcatgt ctgtataccg tcgacctcta gctagagctt ggcgtaatca tggtcatagc 8460 
tgtttcctgt gtgaaattgt tatccgctca caattccaca caacatacga gccggaagca 8520 
taaagtgtaa agcctggggt gcctaatgag tgagctaact cacattaatt gcgttgcgct 8580 
cactgcccgc tttccagtcg ggaaacctgt cgtgccagct gcattaatga atcggccaac 8 640 
gcgcggggag aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc 8700 
tgcgctcggt cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt 8 760 
tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg 8 82 0 
ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg 8 880 
agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat 8940 
accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta 9000 
ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcaa tgctcacgct 9060 
gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc 9120 
ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa 9180 
gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg. 9240 
taggcggtgc tacagagttc ttgaagtggt ggcctaacta cggctacact agaaggacag 9300 
tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt 9360 
gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta 9420 
cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc 9480 
agtggaacga aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca 9540 
cctagatcct tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa 9600 
cttggtctga cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat 9660 
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ttcgttcatc 
taccatctgg 
tatcagcaat 
ccgcctccat 
atagtttgcg 
gtatggcttc 
tgtgcaaaaa 
cagtgttatc 
taagatgctt 
ggcgaccgag 
ctttaaaagt 
cgctgttgag 
ttactttcac 
gaataagggc 
gcatttatca 
aacaaatagg 



catagttgcc 
ccccagtgct 
aaaccagcca 
ccagtctatt 
caacgttgtt 
attcagctcc 
agcggttagc 
actcatggtt 
ttctgtgact 
ttgctcttgc 
gctcatcatt 
atccagttcg 
cagcgtttct 
gacacggaaa 
gggttattgt 
ggttccgcgc 



tgactccccg 
gcaatgatac 
gccggaaggg 
aattgttgcc 
gccattgcta 
ggttcccaac 
tccttcggtc 
atggcagcac 
ggtgagtact 
ccggcgtcaa 
ggaaaacgtt 
atgtaaccca 
gggtgagcaa 
tgttgaatac 
ctcatgagcg 
acatttcccc 



tcgtgtagat 
cgcgagaccc 
ccgagcgcag 
gggaagctag 
caggcatcgt 
gatcaaggcg 
ctccgatcgt 
tgcataattc 
caaccaagtc 
tacgggataa 
cttcggggcg 
ctcgtgcacc 
aaacaggaag 
tcatactctt 
gatacatatt 
gaaaagtgcc 



aactacgata 
acgctcaccg 
aagtggtcct 
agtaagtagt 
ggtgtcacgc 
agttacatga 
tgtcagaagt 
tcttactgtc 
attctgagaa 
taccgcgcca 
aaaactctca 
caactgatct 
gcaaaatgcc 
cctttttcaa 
tgaatgtatt 
acctgacgtc 



cgggagggct 
gctccagatt 
gcaactttat 
tcgccagtta 
tcgtcgtttg 
tcccccatgt 
aagttggccg 
atgccatccg 
tagtgtatgc 
catagcagaa 
aggatcttac 
tcagcatctt 
gcaaaaaagg 
tattattgaa 
tagaaaaata 



9720 

9780 

9840 

9900 

9960 

10020 

10080 

10140 

10200 

10260 

10320 

10380 

10440 

10500 

10560 

10610 



<210> 17 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 17 

tgtacaccgg atccggcgca cacc 24 



<210> 18 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 18 

cacaacgagc tcaattaatt aattgccaca tcctc 35 

<210> 19 

<211> 4 

<212> PRT 

<213> adenovirus 

<400> 19 
Thr Leu Trp Thr 
1 

<210> 20 

<211> 12 

<212> PRT 

<213> adenovirus 

<400> 20 

Pro Ser Ala Ser Ala Ser Ala Ser Ala Pro Gly Ser 
15 10 

<210> 22 
<211> 327 
<212> DNA 
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<213> adenovirus 



<400> 22 

agatctgaat 

ccagtactct 

tgagcgagtc 

agtcgcaagg 

tggcggaggt 

aggtgaggtg 



tcgagctcgc 
tggatcggaa 
cgcatcgacc 
taggctgagc 
gctgctgatg 
tggcaggctt 



<210> 23 
<211> 32480 
<212> DNA 
<213> adenovirus 



tgttgggctc 
acccgtcggc 
ggatcggaaa 
accgtggcgg 
atgtaattaa 
gagatct 



gcggttgagg 
ctccgaacgg 
acctctcgag 
gcggcagcgg 
agtaggcggt 



acaaactctt 
tactccgcca 
aaaggcgtct 
gtggcggtcg 
cttgagacgg 



cgcggtcttt 60 
ccgagggacc 12 0 
aaccagtcac 180 
gggttgtttc 240 
cggatggtcg 3 00 
327 



<400> 23 

catcatcaat 

ttgtgacgtg 

gatgttgcaa 

gtgtgcgccg 

taaatttggg 

agtgaaatct 

aagcttgaat 

tatgcttccg 

cagctatgac 

aggaacttgg 

agggtgtgga 

ttagtcagca 

catgcatctc 

aactccgccc 

agaggccgag 

aggcctaggc 

tcgaacactt 

atcgtcacct 

tctgaacaat 

gaccagaaac 

ggcgagatcg 

cttaatcgcc 

accgatcgcc 

ccggcaccag 

gtcgtcgtcc 

acctatccca 

tcgctcacat 

gatggcgtta 

gacagtcgtt 

ctcgcggtga 

cggatgagcg 

gatttccatg 

gttcagatgt 

gaaacgcagg 

ggttatgccg 

gaaatcccga 

gaagcagaag 

ctgaacggca 

ggtcaggtca 

tttaacgccg 

cgctacggcc 

aatcgtctga 

gtgcagcgcg 

cacggcgcta 

gtgcagtatg 

gcgcgcgtgg 



aatatacctt 
gcgcggggcg 
gtgtggcgga 
gtgtacacag 
cgtaaccgag 
gaataatttt 
tcgattaatg 
gctcgtatgt 
catgattacg 
ttaggtacct 
aagtccccag 
accaggtgtg 
aattagtcag 
agttccgccc 
gccgcctcgg 
ttttgcaaaa 
tttaagccgt 

gggacatgtt 

ggaaaggcat 
agcacctcga 
atcccgtcgt 
ttgcagcaca 
cttcccaaca 
aagcggtgcc 
cctcaaactg 
ttacggtcaa 
ttaatgttga 
actcggcgtt 
tgccgtctga 
tggtgctgcg 
gcattttccg 
ttgccactcg 
gcggcgagtt 
tcgccagcgg 
atcgcgtcac 
atctctatcg 
cctgcgatgt 
agccgttgct 
tggatgagca 
tgcgctgttc 
tgtatgtggt 
ccgatgatcc 
atcgtaatca 
atcacgacgc 
aaggcggcgg 
atgaagacca 



attttggatt 
tgggaacggg 
acacatgtaa 
gaagtgacaa 
taagatttgg 
gtgttactca 
tgagttagct 
tgtgtggaat 
aattcggcgc 
tctgaggcgg 
gctccccagc 
gaaagtcccc 
caaccatagt 
attctccgcc 
cctctgagct 
agcttgggat 
agataaacag 
gcagatccat 
tattgccgta 
actgagccgc 
tttacaacgt 
tccccctttc 
gttgcgcagc 
ggaaagctgg 
gcagatgcac 
tccgccgttt 
tgaaagctgg 
tcatctgtgg 
atttgacctg 
ttggagtgac 
tgacgtctcg 
ctttaatgat 
gcgtgactac 
caccgcgcct 
actacgtctg 
tgcggtggtt 
cggtttccgc 
gattcgaggc 
gacgatggtg 
gcattatccg 
ggatgaagcc 
gcgctggcta 
cccgagtgtg 
gctgtatcgc 
agccgacacc 
gcccttcccg 



gaagccaata 
gcgggtgacg 
gcgacggatg 
ttttcgcgcg 
ccattttcgc 
tagcgcgtaa 
cactcattag 
tgtgagcgga 
agcaccatgg 
aaagaaccag 
aggcagaagt 
aggctcccca 
cccgccccta 
ccatggctga 
attccagaag 
ctctataatc 
gctgggacac 
gcacgtaaac 
agccgtggcg 
gatattgccc 
cgtgactggg 
gccagctggc 
ctgaatggcg 
ctggagtgcg 
ggttacgatg 
gttcccacgg 
ctacaggaag 
tgcaacgggc 
agcgcatttt 
ggcagttatc 
ttgctgcata 
gatttcagcc 
ctacgggtaa 
ttcggcggtg 
aacgtcgaaa 
gaactgcaca 
gaggtgcgga 
gttaaccgtc 
caggatatcc 
aaccatccgc 
aatattgaaa 
ccggcgatga 
atcatctggt 
tggatcaaat 
acggccaccg 
gctgtgccga 



tgataatgag 
tagtagtgtg 
tggcaaaagt 
gttttaggcg 
gggaaaactg 
tctctagcat 
gcaccccagg 
taacaatttc 
cctgaaataa 
ctgtggaatg 
atgcaaagca 
gcaggcagaa 
actccgccca 
ctaatttttt 
tagtgaggag 
tcgcgcaacc 
ttcacatgag 
tcgcaagccg 
gtctggtacc 
agcgtttcaa 
aaaaccctgg 
gtaatagcga 
aatggcgctt 
atcttcctga 
cgcccatcta 
agaatccgac 
gccagacgcg 
gctgggtcgg 
tacgcgccgg 
tggaagatca 
aaccgactac 
gcgctgtact 
cagtttcttt 
aaattatcga 
acccgaaact 
ccgccgacgg 
ttgaaaatgg 
acgagcatca 
tgctgatgaa 
tgtggtacac 
cccacggcat 
gcgaacgcgt 
cgctggggaa 
ctgtcgatcc 
atattatttg 
aatggtccat 



ggggtggagt 
gcggaagtgt 
gacgtttttg 
gatgttgtag 
aataagagga 
cgatgtcgac 
ctttacactt 
acacaggaaa 
cctctgaaag 
tgtgtcagtt 
tgcatctcaa 
gtatgcaaag 
tcccgcccct 
ttatttatgc 
gcttttttgg 
tattttcccc 
cgaaaaatac 
actgatgcct 
ggtgggtgaa 
cgcgctgtat 
cgttacccaa 
agaggcccgc 
tgcctggttt 
ggccgatact 
caccaacgta 
gggttgttac 
aattattttt 
ttacggccag 
agaaaaccgc 
ggatatgtgg 
acaaatcagc 
ggaggctgaa 
atggcagggt 
tgagcgtggt 
gtggagcgcc 
cacgctgatt 
tctgctgctg 
tcctctgcat 
gcagaacaac 
gctgtgcgac 
ggtgccaatg 
aacgcgaatg 
tgaatcaggc 
ttcccgcccg 
cccgatgtac 
caaaaaatgg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 
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ctttcgctac ctggagagac gcgcccgctg 
aacagtcttg gcggtttcgc taaatactgg 
ggcggcttcg tctgggactg ggtggatcag 
ccgtggtcgg cttacggcgg tgattttggc 
aacggtctgg tctttgccga ccgcacgccg 
cagcagtttt tccagttccg tttatccggg 
fctccgtcata gcgataacga gctcctgcac 
gcaagcggtg aagtgcctct ggatgtcgct 
gaactaccgc agccggagag cgccgggcaa 
aacgcgaccg catggtcaga agccgggcac 
gaaaacctca gtgtgacgct ccccgccgcg 
gaaatggatt tttgcatcga gctgggtaat 
tttctttcac agatgtggat tggcgataaa 
ttcacccgtg caccgctgga taacgacatt 
aacgcctggg tcgaacgctg gaaggcggcg 
cagtgcacgg cagatacact tgctgatgcg 
catcagggga aaaccttatt tatcagccgg 
atggcgatta ccgttgatgt tgaagtggcg 
ctgaactgcc agctggcgca ggtagcagag 
gaaaactatc ccgaccgcct tactgccgcc 
gacatgtata ccccgtacgt cttcccgagc 
ttgaattatg gcccacacca gtggcgcggc 
caacagcaac tgatggaaac cagccatcgc 
ctgaatatcg acggtttcca tatggggatt 
tcggcggaat tccagctgag cgccggtcgc 
taataataac cgggcaggcc atgtctgccc 
tatttaaaaa acacaaactt ttggatgttc 
atgggagcct acttcccgtt tttcccgatt 
agtgatacgg gtattatttt tgccgctatt 
tttggtctgc tttctgacaa actcggaact 
aaagcaatag catcacaaat tt.cacaaata 
gtttgtccaa actcatcaat gtatcttatc 
agggtgggaa agaatatata aggtgggggt 
gccgccgccg ccatgagcac caactcgttt 
acgcgcatgc ccccatgggc cggggtgcgt 
cgccccgtcc tgcccgcaaa ctctactacc 
ttggagactg cagcctccgc cgccgcttca 
actgactttg ctttcctgag cccgcttgca 
gatgacaagt tgacggctct tttggcacaa 
gtttctcagc agctgttgga tctgcgccag 
cccaatgcgg tttaaaacat aaataaaaaa 
gtgtcttgct gtctttattt aggggttttg 
cggtcgttga gggtcctgtg tattttttcc 
agatacatgg gcataagccc gtctctgggg 
tgcggggtgg tgttgtagat gatccagtcg 
atgtctttca gtagcaagct gattgccagg 
cggttaagct gggatgggtg catacgtggg 
aggttggcta tgttcccagc catatccctc 
acagtgtatc cggtgcactt gggaaatttg 
aacttggaga cgcccttgtg acctccaaga 
atgggcccac gggcggcggc ctgggcgaag 
tgttccagga tgagatcgtc ataggccatt 
tgcggtataa tggttccatc cggcccaggg 
cacgctttga gttcagatgg ggggatcatg 
tccggggtag gggagatcag ctgggaagaa 
cagccggtgg gcccgtaaat cacacctatt 
cagctgccgt catccctgag caggggggcc 
ttttccctga ccaaatccgc cagaaggcgc 
gaagcaaagt ttttcaacgg tttgagaccg 
ccaagcagtt ccaggcggtc ccacagctcg 
atatctcctc gtttcgcggg ttggggcggc 
ccagacgggc cagggtcatg tctttccacg 



atcctttgcg aatacgccca cgcgatgggt 282 0 
caggcgtttc gtcagtatcc ccgtttacag 2 88 0 
tcgctgatta aatatgatga aaacggcaac 2940 
gatacgccga acgatcgcca gttctgtatg 3 000 
catccagcgc tgacggaagc aaaacaccag 3 060 
caaaccatcg aagtgaccag cgaatacctg 312 0 
tggatggtgg cgctggatgg taagccgctg 3180 
ccacaaggta aacagttgat tgaactgcct 3240 
ctctggctca cagtacgcgt agtgcaaccg 330 0 
atcagcgcct ggcagcagtg gcgtctggcg 3 36 0 
tcccacgcca tcccgcatct gaccaccagc 342 0 
aagcgttggc aatttaaccg ccagtcaggc 3480 
aaacaactgc tgacgccgct gcgcgatcag 3540 
ggcgtaagtg aagcgacccg cattgaccct 3 600 
ggccattacc aggccgaagc agcgttgttg 3660 
gtgctgatta cgaccgctca cgcgtggcag 3 72 0 
aaaacctacc ggattgatgg tagtggtcaa 3 78 0 
agcgatacac cgcatccggc gcggattggc 3 84 0 
cgggtaaact ggctcggatt agggccgcaa 3900 
tgttttgacc gctgggatct gccattgtca 3 960 
gaaaacggtc tgcgctgcgg gacgcgcgaa 4 02 0 
gacttccagt tcaacatcag ccgctacagt 4080 
catctgctgc acgcggaaga aggcacatgg 4140 
ggtggcgacg actcctggag cccgtcagta 4200 
taccattacc agttggtctg gtgtcaaaaa 4260 
gtatttcgcg taaggaaatc cattatgtac 432 0 
ggtttattct ttttctttta cttttttatc 4380 
tggctacatg acatcaacca tatcagcaaa 4440 
tctctgttct cgctattatt ccaaccgctg 4500 
tgtttattgc agcttataat ggttacaaat 456 0 
aagcattttt ttcactgcat tctagttgtg 4 62 0 
atgtctggat ccagatctgg gcgtggctta 4680 
cttatgtagt tttgtatctg ttttgcagca 4740 
gatggaagca ttgtgagctc atatttgaca 4 80 0 
cagaatgtga tgggctccag cattgatggt 4 860 
ttgacctacg agaccgtgtc tggaacgccg 4 920 
gccgctgcag ccaccgcccg cgggattgtg 4 980 
agcagtgcag cttcccgttc atccgcccgc 5040 
ttggattctt tgacccggga acttaatgtc 5100 
caggtttctg ccctgaaggc ttcctcccct 5160 
ccagactctg tttggatttg gatcaagcaa 522 0 
cgcgcgcggt aggcccggga ccagcggtct 5280 
aggacgtggt aaaggtgact ctggatgttc 5340 
tggaggtagc accactgcag agcttcatgc 540 0 
tagcaggagc gctgggcgtg gtgcctaaaa 5460 
ggcaggccct tggtgtaagt gtttacaaag 5520 
gatatgagat gcatcttgga ctgtattttt 5580 
cggggattca tgttgtgcag aaccaccagc 5640 
tcatgtagct tagaaggaaa tgcgtggaag 5700 
ttttccatgc attcgtccat aatgatggca 5760 
atatttctgg gatcactaac gtcatagttg 5820 
tttacaaagc gcgggcggag ggtgccagac 5880 
gcgtagttac cctcacagat ttgcatttcc 5940 
tctacctgcg gggcgatgaa gaaaacggtt 6000 
agcaggttcc tgagcagctg cgacttaccg 6060 
accgggtgca actggtagtt aagagagctg 6120 
acttcgttaa gcatgtccct gactcgcatg 6180 
tcgccgccca gcgatagcag ttcttgcaag 6240 
tccgccgtag gcatgctttt gagcgtttga 6300 
gtcacctgct ctacggcatc tcgatccagc 6360 
tttcgctgta cggcagtagt cggtgctcgt 6420 
ggcgcagggt cctcgtcagc gtagtctggg 6480 
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tcacggtgaa ggggtgcgct ccgggctgcg cgctggccag ggtgcgcttg aggctggtcc 6540 
tgctggtgct gaagcgctgc cggtcttcgc cctgcgcgtc ggccaggtag catttgacca 6600 
tggtgtcata gtccagcccc tccgcggcgt ggcccttggc gcgcagcttg cccttggagg 6660 
aggcgccgca cgaggggcag tgcagacttt tgagggcgta gagcttgggc gcgagaaata 672 0 
ccgattccgg ggagtaggca tccgcgccgc aggccccgca gacggtctcg cattccacga 6780 
gccaggtgag ctctggccgt tcggggtcaa aaaccaggtt tcccccatgc tttttgatgc 6840 
gtttcttacc tctggtttcc atgagccggt gtccacgctc ggtgacgaaa aggctgtccg 6900 
tgtccccgta tacagacttg agaggcctgt cctcgagcgg tgttccgcgg tcctcctcgt 6960 
atagaaactc ggaccactct gagacaaagg ctcgcgtcca ggccagcacg aaggaggcta 7020 
agtgggaggg gtagcggtcg ttgtccacta gggggtccac tcgctccagg gtgtgaagac 7080 
acatgtcgcc ctcttcggca tcaaggaagg tgattggttt gtaggtgtag gccacgtgac 7140 
cgggtgttcc tgaagggggg ctataaaagg gggtgggggc gcgttcgtcc tcactctctt 72 0 0 
ccgcatcgct gtctgcgagg gccagctgtt ggggtgagta ctccctctga aaagcgggca 7260 
tgacttctgc gctaagattg tcagtttcca aaaacgagga ggatttgata ttcacctggc 7320 
ccgcggtgat gcctttgagg gtggccgcat ccatctggtc agaaaagaca atctttttgt 7380 
tgtcaagctt ggtggcaaac gacccgtaga gggcgttgga cagcaacttg gcgatggagc 744 0 
gcagggtttg gtttttgtcg cgatcggcgc gctccttggc cgcgatgttt agctgcacgt 7500 
attcgcgcgc aacgcaccgc cattcgggaa agacggtggt gcgctcgtcg ggcaccaggt 7560 
gcacgcgcca accgcggttg tgcagggtga caaggtcaac gctggtggct acctctccgc 762 0 
gtaggcgctc gttggtccag cagaggcggc cgcccttgcg cgagcagaat ggcggtaggg 7680 
ggtctagctg cgtctcgtcc ggggggtctg cgtccacggt aaagaccccg ggcagcaggc 774 0 
gcgcgtcgaa gtagtctatc ttgcatcctt gcaagtctag cgcctgctgc catgcgcggg 7 8 00 
cggcaagcgc gcgctcgtat gggttgagtg gggga cccca. tggcatgggg tgggtgagcg 7860 
cggaggcgta catgccgcaa atgtcgtaaa cgtagagggg ctctctgagt attccaagat 792 0 
atgtagggta gcatcttcca ccgcggatgc tggcgcgcac gtaatcgtat agttcgtgcg 7980 
agggagcgag gaggtcggga ccgaggttgc tacgggcggg ctgctctgct cggaagacta 8040 
tctgcctgaa gatggcatgt gagttggatg atatggttgg acgctggaag acgttgaagc 8100 
tggcgtctgt gagacctacc gcgtcacgca cgaaggaggc gtaggagtcg cgcagcttgt 8160 
tgaccagctc ggcggtgacc tgcacgtcta gggcgcagta gtccagggtt tccttgatga 822 0 
tgtcatactt atcctgtccc ttttttttcc acagctcgcg gttgaggaca aactcttcgc 8280 
ggtctttcca gtactcttgg atcggaaacc cgtcggcctc cgaacggtaa gagcctagca 834 0 
tgtagaactg gttgacggcc tggtaggcgc agcatccctt ttctacgggt agcgcgtatg 8400 
cctgcgcggc cttccggagc gaggtgtggg tgagcgcaaa ggtgtccctg accatgactt 8460 
tgaggtactg gtatttgaag tcagtgtcgt cgcatccgcc ctgctcccag agcaaaaagt 8520 
ccgtgcgctt tttggaacgc ggatttggca gggcgaaggt gacatcgttg aagagtatct 8580 
ttcccgcgcg aggcataaag ttgcgtgtga tgcggaaggg tcccggcacc tcggaacggt 8640 
tgttaattac ctgggcggcg agcacgatct cgtcaaagcc gttgatgttg tggcccacaa 8700 
tgtaaagttc caagaagcgc gggatgccct tgatggaagg caatttttta agttcctcgt 8760 
aggtgagctc ttcaggggag ctgagcccgt gctctgaaag ggcccagtct gcaagatgag 882 0 
ggttggaagc gacgaatgag ctccacaggt cacgggccat tagcatttgc aggtggtcgc 8 880 
gaaaggtcct aaactggcga cctatggcca ttttttctgg ggtgatgcag tagaaggtaa 8940 
gcgggtcttg ttcccagcgg tcccatccaa ggttcgcggc taggtctcgc gcggcagtca 9000 
ctagaggctc atctccgccg aacttcatga ccagcatgaa gggcacgagc tgcttcccaa 9060 
aggcccccat ccaagtatag gtctctacat cgtaggtgac aaagagacgc tcggtgcgag 912 0 
gatgcgagcc gatcgggaag aactggatct cccgccacca attggaggag tggctattga 9180 
tgtggtgaaa gtagaagtcc ctgcgacggg ccgaacactc gtgctggctt ttgtaaaaac 9240 
gtgcgcagta ctggcagcgg tgcacgggct gtacatcctg cacgaggttg acctgacgac 93 00 
cgcgcacaag gaagcagagt gggaatttga gcccctcgcc tggcgggttt ggctggtggt 9360 
cttctacttc ggctgcttgt ccttgaccgt ctggctgctc gaggggagtt acggtggatc 942 0 
ggaccaccac gccgcgcgag cccaaagtcc agatgtccgc gcgcggcggt cggagcttga 9480 
tgacaacatc gcgcagatgg gagctgtcca tggtctggag ctcccgcggc gtcaggtcag 9540 
gcgggagctc ctgcaggttt acctcgcata gacgggtcag ggcgcgggct agatccaggt 9600 
gatacctaat ttccaggggc tggttggtgg cggcgtcgat ggcttgcaag aggccgcatc 9660 
cccgcggcgc gactacggta ccgcgcggcg ggcggtgggc cgcgggggtg tccttggatg 972 0 
atgcatctaa aagcggtgac gcgggcgagc ccccggaggt agggggggct ccggacccgc 9780 
cgggagaggg ggcaggggca cgtcggcgcc gcgcgcgggc aggagctggt gctgcgcgcg 984 0 
taggttgctg gcgaacgcga cgacgcggcg gttgatctcc tgaatctggc gcctctgcgt 9900 
gaagacgacg ggcccggtga gcttgagcct gaaagagagt tcgacagaat caatttcggt 9960 
gtcgttgacg gcggcctggc gcaaaatctc ctgcacgtct cctgagttgt cttgataggc 10020 
gatctcggcc atgaactgct cgatctcttc ctcctggaga tctccgcgtc cggctcgctc 10080 
cacggtggcg gcgaggtcgt tggaaatgcg ggccatgagc tgcgagaagg cgttgaggcc 10140 
tccctcgttc cagacgcggc tgtagaccac gcccccttcg gcatcgcggg cgcgcatgac 10200 
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cacctgcgcg agattgagct ccacgtgccg ggcgaagacg gcgtagtttc gcaggcgctg 10260 
aaagaggtag ttgagggtgg tggcggtgtg ttctgccacg aagaagtaca taacccagcg 10320 
tcgcaacgtg gattcgttga tatcccccaa ggcctcaagg cgctccatgg cctcgtagaa 103 8 0 
gtccacggcg aagttgaaaa actgggagtt gcgcgccgac acggttaact cctcctccag 1044 0 
aagacggatg agctcggcga cagtgtcgcg cacctcgcgc tcaaaggcta caggggcctc 10500 
ttcttcttct tcaatctcct cttccataag ggcctcccct tcttcttctt ctggcggcgg 10560 
tgggggaggg gggacacggc ggcgacgacg gcgcaccggg aggcggtcga caaagcgctc 10620 
gatcatctcc ccgcggcgac ggcgcatggt ctcggtgacg gcgcggccgt tctcgcgggg 10680 
gcgcagttgg aagacgccgc ccgtcatgtc ccggttatgg gttggcgggg ggctgccatg 10740 
cggcagggat acggcgctaa cgatgcatct caacaattgt tgtgtaggta ctccgccgcc 10800 
gagggacctg agcgagtccg catcgaccgg atcggaaaac ctctcgagaa aggcgtctaa 10860 
ccagtcacag tcgcaaggta ggctgagcac cgtggcgggc ggcagcgggc ggcggtcggg 10920 
gttgtttctg gcggaggtgc tgctgatgat gtaattaaag taggcggtct tgagacggcg 10980 
gatggtcgac agaagcacca tgtccttggg tccggcctgc tgaatgcgca ggcggtcggc 11040 
catgccccag gcttcgtttt gacatcggcg caggtctttg tagtagtctt gcatgagcct 11100 
ttctaccggc acttcttctt ctccttcctc ttgtcctgca tctcttgcat ctatcgctgc 11160 
ggcggcggcg gagtttggcc gtaggtggcg ccctcttcct cccatgcgtg tgaccccgaa 11220 
gcccctcatc ggctgaagca gggctaggtc ggcgacaacg cgctcggcta atatggcctg 11280 
ctgcacctgc gtgagggtag actggaagtc atccatgtcc acaaagcggt ggtatgcgcc 11340 
cgtgttgatg gtgtaagtgc agttggccat aacggaccag ttaacggtct ggtgacccgg 11400 
ctgcgagagc tcggtgtacc tgagacgcga gtaagccctc gagtcaaata cgtagtcgtt 11460 
gcaagtccgc accaggtact ggtatcccac caaaaagtgc ggcggcggct ggcggtagag 11520 
gggccagcgt agggtggccg gggctccggg ggcgagatct tccaacataa ggcgatgata 11580 
tccgtagatg tacctggaca tccaggtgat gccggcggcg gtggtggagg cgcgcggaaa 11640 
gtcgcggacg cggttccaga tgttgcgcag cggcaaaaag tgctccatgg tcgggacgct 11700 
ctggccggtc aggcgcgcgc aatcgttgac gctctagacc gtgcaaaagg agagcctgta 11760 
agcgggcact cttccgtggt ctggtggata aattcgcaag ggtatcatgg cggacgaccg 1182 0 
gggttcgagc cccgtatccg gccgtccgcc gtgatccatg cggttaccgc ccgcgtgtcg 118 80 
aacccaggtg tgcgacgtca gacaacgggg gagtgctcct tttggcttcc ttccaggcgc 11940 
ggcggctgct gcgctagctt ttttggccac tggccgcgcg cagcgtaagc ggttaggctg 120 00 
gaaagcgaaa gcattaagtg gctcgctccc tgtagccgga gggttatttt ccaagggttg 12060 
agtcgcggga cccccggttc gagtctcgga ccggccggac tgcggcgaac gggggtttgc 12120 
ctccccgtca tgcaagaccc cgcttgcaaa ttcctccgga aacagggacg agcccctttt 12180 
ttgcttttcc cagatgcatc cggtgctgcg gcagatgcgc ccccctcctc agcagcggca 1224 0 
agagcaagag cagcggcaga catgcagggc accctcccct cctcctaccg cgtcaggagg 123 0 0 
ggcgacatcc gcggttgacg cggcagcaga tggtgattac gaacccccgc ggcgccgggc 12360 
ccggcactac ctggacttgg aggagggcga gggcctggcg cggctaggag cgccctctcc 12420 
tgagcggtac ccaagggtgc agctgaagcg tgatacgcgt gaggcgtacg tgccgcggca 124 80 
gaacctgttt cgcgaccgcg agggagagga gcccgaggag atgcgggatc gaaagttcca 12540 
cgcagggcgc gagctgcggc atggcctgaa tcgcgagcgg ttgctgcgcg aggaggactt 12600 
tgagcccgac gcgcgaaccg ggattagtcc cgcgcgcgca cacgtggcgg ccgccgacct 12660 
ggtaaccgca tacgagcaga cggtgaacca ggagattaac tttcaaaaaa gctttaacaa 1272 0 
ccacgtgcgt acgcttgtgg cgcgcgagga ggtggctata ggactgatgc atctgtggga 12780 
ctttgtaagc gcgctggagc aaaacccaaa tagcaagccg ctcatggcgc agctgttcct 12840 
tatagtgcag cacagcaggg acaacgaggc attcagggat gcgctgctaa acatagtaga 12900 
gcccgagggc cgctggctgc tcgatttgat aaacatcctg cagagcatag tggtgcagga 12960 
gcgcagcttg agcctggctg acaaggtggc cgccatcaac tattccatgc ttagcctggg 1302 0 
caagttttac gcccgcaaga tataccatac cccttacgtt cccatagaca aggaggtaaa 13080 
gatcgagggg ttctacatgc gcatggcgct gaaggtgctt accttgagcg acgacctggg 13140 
cgtttatcgc aacgagcgca tccacaaggc cgtgagcgtg agccggcggc gcgagctcag 13200 
cgaccgcgag ctgatgcaca gcctgcaaag ggccctggct ggcacgggca gcggcgatag 13260 
agaggccgag tcctactttg acgcgggcgc tgacctgcgc tgggccccaa gccgacgcgc 13320 
cctggaggca gctggggccg gacctgggct ggcggtggca cccgcgcgcg ctggcaacgt 13380 
cggcggcgtg gaggaatatg acgaggacga tgagtacgag ccagaggacg gcgagtacta 13440 
agcggtgatg tttctgatca gatgatgcaa gacgcaacgg acccggcggt gcgggcggcg 13500 
ctgcagagcc agccgtccgg ccttaactcc acggacgact ggcgccaggt catggaccgc 13560 
atcatgtcgc tgactgcgcg caatcctgac gcgttccggc agcagccgca ggccaaccgg 13620 
ctctccgcaa ttctggaagc ggtggtcccg gcgcgcgcaa accccacgca cgagaaggtg 13680 
ctggcgatcg taaacgcgct ggccgaaaac agggccatcc ggcccgacga ggccggcctg 13740 
gtctacgacg cgctgcttca gcgcgtggct cgttacaaca gcggcaacgt gcagaccaac 13800 
ctggaccggc tggtggggga tgtgcgcgag gccgtggcgc agcgtgagcg cgcgcagcag 13860 
cagggcaacc tgggctccat ggttgcacta aacgccttcc tgagtacaca gcccgccaac 13920 
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gtgccgcggg gacaggagga ctacaccaac tttgtgagcg cactgcggct aatggtgact 13980 
gagacaccgc aaagtgaggt gtaccagtct gggccagact attttttcca gaccagtaga 14 04 0 
caaggcctgc agaccgtaaa cctgagccag gctttcaaaa acttgcaggg gctgtggggg 1410 0 
gtgcgggctc ccacaggcga ccgcgcgacc gtgtctagct tgctgacgcc caactcgcgc 14160 
ctgttgctgc tgctaatagc gcccttcacg gacagtggca gcgtgtcccg ggacacatac 14220 
ctaggtcact tgctgacact gtaccgcgag gccataggtc aggcgcatgt ggacgagcat 14280 
actttccagg agattacaag tgtcagccgc gcgctggggc aggaggacac gggcagcctg 14340 
gaggcaaccc taaactacct gctgaccaac cggcggcaga agatcccctc gttgcacagt 1440 0 
ttaaacagcg aggaggagcg cattttgcgc tacgtgcagc agagcgtgag ccttaacctg 14460 
atgcgcgacg gggtaacgcc cagcgtggcg ctggacatga ccgcgcgcaa catggaaccg 14520 
ggcatgtatg cctcaaaccg gccgtttatc aaccgcctaa tggactactt gcatcgcgcg 14580 
gccgccgtga accccgagta tttcaccaat gccatcttga acccgcactg gctaccgccc 14640 
cctggtttct acaccggggg attcgaggtg cccgagggta acgatggatt cctctgggac 14700 
gacatagacg acagcgtgtt ttccccgcaa ccgcagaccc tgctagagtt gcaacagcgc 14760 
gagcaggcag aggcggcgct gcgaaaggaa agcttccgca ggccaagcag cttgtccgat 14820 
ctaggcgctg cggccccgcg gtcagatgct agtagcccat ttccaagctt gatagggtct 14880 
cttaccagca ctcgcaccac ccgcccgcgc ctgctgggcg aggaggagta cctaaacaac 14 940 
tcgctgctgc agccgcagcg cgaaaaaaac ctgcctccgg catttcccaa caacgggata 15000 
gagagcctag tggacaagat gagtagatgg aagacgtacg cgcaggagca cagggacgtg 15060 
ccaggcccgc gcccgcccac ccgtcgtcaa aggcacgacc gtcagcgggg tctggtgtgg 15120 
gaggacgatg actcggcaga cgacagcagc gtcctggatt tgggagggag tggcaacccg 15180 
tttgcgcacc ttcgccccag gctggggaga atgttttaaa aaaaaaaaag catgatgcaa 15240 
aataaaaaac tcaccaaggc catggcaccg agcgttggtt ttcttgtatt ccccttagta 15300 
tgcggcgcgc ggcgatgtat gaggaaggtc ctcctccctc ctacgagagt gtggtgagcg 153 60 
cggcgccagt ggcggcggcg ctgggttctc ccttcgatgc tcccctggac ccgccgtttg 15420 
tgcctccgcg gtacctgcgg cctaccgggg ggagaaacag catccgttac tctgagttgg 15480 
cacccctatt cgacaccacc cgtgtgtacc tggtggacaa caagtcaacg gatgtggcat 15540 
ccctgaacta ccagaacgac cacagcaact ttctgaccac ggtcattcaa aacaatgact 15600 
acagcccggg ggaggcaagc acacagacca tcaatcttga cgaccggtcg cactggggcg 15660 
gcgacctgaa aaccatcctg cataccaaca tgccaaatgt gaacgagttc atgtttacca 15720 
ataagtttaa ggcgcgggtg atggtgtcgc gcttgcctac taaggacaat caggtggagc 15780 
tgaaatacga gtgggtggag ttcacgctgc ccgagggcaa ctactccgag accatgacca 15840 
tagaccttat gaacaacgcg atcgtggagc actacttgaa agtgggcaga cagaacgggg 15900 
ttctggaaag cgacatcggg gtaaagtttg acacccgcaa cttcagactg gggtttgacc 15960 
ccgtcactgg tcttgtcatg cctggggtat atacaaacga agccttccat ccagacatca 16020 
ttttgctgcc aggatgcggg gtggacttca cccacagccg cctgagcaac ttgttgggca 16080 
tccgcaagcg gcaacccttc caggagggct ttaggatcac ctacgatgat ctggagggtg 16140 
gtaacattcc cgcactgttg gatgtggacg cctaccaggc gagcttgaaa gatgacaccg 16200 
aacagggcgg gggtggcgca ggcggcagca acagcagtgg cagcggcgcg gaagagaact 16260 
ccaacgcggc agccgcggca atgcagccgg tggaggacat gaacgatcat gccattcgcg 16320 
gcgacacctt tgccacacgg gctgaggaga agcgcgctga ggccgaagca gcggccgaag 163 80 
ctgccgcccc cgctgcgcaa cccgaggtcg agaagcctca gaagaaaccg gtgatcaaac 16440 
ccctgacaga ggacagcaag aaacgcagtt acaacctaat aagcaatgac agcaccttca 16500 
cccagtaccg cagctggtac cttgcataca actacggcga ccctcagacc ggaatccgct 16560 
catggaccct gctttgcact cctgacgtaa cctgcggctc ggagcaggtc tactggtcgt 16620 
tgccagacat gatgcaagac cccgtgacct tccgctccac gcgccagatc agcaactttc 16680 
cggtggtggg cgccgagctg ttgcccgtgc actccaagag cttctacaac gaccaggccg 16740 
tctactccca actcatccgc cagtttacct ctctgaccca cgtgttcaat cgctttcccg 16SQ0 
agaaccagat tttggcgcgc ccgccagccc ccaccatcac caccgtcagt gaaaacgttc 16860 
ctgctctcac agatcacggg acgctaccgc tgcgcaacag catcggagga gtccagcgag 16920 
tgaccattac tgacgccaga cgccgcacct gcccctacgt ttacaaggcc ctgggcatag 16980 
tctcgccgcg cgtcctatcg agccgcactt tttgagcaag catgtccatc cttatatcgc 17040 
ccagcaataa cacaggctgg ggcctgcgct tcccaagcaa gatgtttggc ggggccaaga 17100 
agcgctccga ccaacaccca gtgcgcgtgc gcgggcacta ccgcgcgccc tggggcgcgc 17160 
acaaacgcgg ccgcactggg cgcaccaccg tcgatgacgc catcgacgcg gtggtggagg 17220 
aggcgcgcaa ctacacgccc acgccgccac cagtgtccac agtggacgcg gccattcaga 17280 
ccgtggtgcg cggagcccgg cgctatgcta aaatgaagag acggcggagg cgcgtagcac 17340 
gtcgccaccg ccgccgaccc ggcactgccg cccaacgcgc ggcggcggcc ctgcttaacc 17400 
gcgcacgtcg caccggccga cgggcggcca tgcgggccgc tcgaaggctg gccgcgggta 17460 
ttgtcactgt gccccccagg tccaggcgac gagcggccgc cgcagcagcc gcggccatta 17520 
gtgctatgac tcagggtcgc aggggcaacg tgtattgggt gcgcgactcg gttagcggcc 17580 
tgcgcgtgcc cgtgcgcacc cgccccccgc gcaactagat tgcaagaaaa aactacttag 17640 
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actcgtactg 
aaatcaaaga 
aagagcagga 
atgatgaact 
agtggaaagg 
ccggtgagcg 
acctgcttga 
acatgctggc 
tgcagcaggt 
ctggtgactt 
tcttggaaaa 
agcaggtggc 
gcaccagtat 
cggtggcgga 
tgcaaacgga 
agtacggcgc 
cccccggcta 
ccaccactgg 
tgcgcagggt 
ccagcatcgt 
tccgtttccc 
acggcctgac 
gcatgcgcgg 
tgcccggaat 
tgtggaaaaa 
tgtagaatgg 
atgggaaact 
tcgctgtgga 
tggaacagca 
aaggtggtag 
gtgcaaaata 
gccgtggaga 
gaaactctgg 
ctgcccacca 
gtaacgctgg 
accgccgttg 
cgatcgttgc 
ctgggggtgc 
tgtcatgtat 
ccaagatggc 
acgcctcgga 
tcagcctgaa 
accggtccca 
cgtacaaggc 
cgtactttga 
ctgcctacaa 
ctactgctct 
agcaagctga 
ttacaaagga 
catttcaacc 
cagctgggag 
aacccacaaa 
aaagtcaagt 
acttgactcc 
atatttctta 
ctatgcccaa 
acaacagcac 
tagatttgca 
atagaaccag 
gaattattga 
gtgtgattaa 
gggaaaaaga 



ttgtatgtat 
agagatgctc 
ttacaagccc 
tgacgacgag 
tcgacgcgta 
ctccacccgc 
gcaggccaac 
gttgccgctg 
gctgcccgcg 
ggcacccacc 
aatgaccgtg 
gccgggactg 
tgccaccgcc 
tgccgcggtg 
cccgtggatg 
cgccagcgcg 
tcgtggctac 
aacccgccgc 
ggctcgcgaa 
ttaaaagccg 
ggtgccggga 

gggcggcatg 

cggtatcctg 
tgcatccgtg 
tcaaaataaa 
aagacatcaa 
ggcaagatat 
gcggcattaa 
gcacaggcca 
atggcctggc 
agattaacag 
cagtgtctcc 
tgacgcaaat 
cccgtcccat 
acctgcctcc 
ttgtaacccg 
ggcccgtagc 
aatccctgaa 
gcgtccatgt 
taccccttcg 
gtacctgagc 
taacaagttt 
gcgtttgacg 
gcggttcacc 
catccgcggc 
cgccctggct 
tgaaataaac 
gcagcaaaaa 

gggtattcaa 

tgaacctcaa 
agtccttaaa 
tgaaaatgga 
ggaaatgcaa 
taaagtggta 
catgcccact 
caggcctaat 
gggtaatatg 
agacagaaac 
gtacttttct 
aaatcatgga 
tacagagact 
tgctacagaa 



ccagcggcgg 
caggtcatcg 
cgaaagctaa 
gtggaactgc 
aaacgtgttt 
acctacaagc 
gagcgcctcg 
gacgagggca 
cttgcaccgt 
gtgcagctga 
gaacctgggc 
ggcgtgcaga 
acagagggca 
caggcggtcg 
tttcgcgttt 
ctactgcccg 
acctaccgcc 
cgccgtcgcc 
ggaggcagga 
gtctttgtgg 
ttccgaggaa 
cgtcgtgcgc 
cccctcctta 
gccttgcagg 
aagtctggac 
ctttgcgtct 
cggcaccagc 
aaatttcggt 
gatgctgagg 
ctctggcatt 
taagcttgat 
agaggggcgt 
agacgagcct 
cgcgcccatg 
ccccgccgac 
tcctagccgc 
cagtggcaac 
gcgccgacga 
cgccgccaga 
atgatgccgc 
cccgggctgg 
agaaacccca 
ctgcggttca 
ctagctgtgg 
gtgctggaca 
cccaagggtg 
ctagaagaag 
actcacgtat 
ataggtgtcg 
ataggagaat 
aagactaccc 
gggcaaggca 
tttttctcaa 
ttgtacagtg 
attaaggaag 
tacattgctt 
ggtgttctgg 
acagagcttt 
atgtggaatc 
actgaagatg 
cttaccaagg 
ttttcagata 



cggcgcgcaa 
cgccggagat 

a g°gggtcaa 

tgcacgctac 
tgcgacccgg 
gcgtgtatga 

gggagtttgc 
acccaacacc 
ccgaagaaaa 
tggtacccaa 
tggagcccga 
ccgtggacgt 
tggagacaca 
ctgcggccgc 
cagccccccg 
aatatgccct 
ccagaagacg 
gtcgccagcc 
ccctggtgct 
ttcttgcaga 
gaatgcaccg 
accaccggcg 
ttccactgat 
cgcagagaca 
tctcacgctc 
ctggccccgc 
aatatgagcg 
tccaccgtta 
gataagttga 
agcggggtgg 
ccccgccctc 
ggcgaaaagc 
ccctcgtacg 
gctaccggag 
acccagcaga 
gcgtccctgc 
tggcaaagca 
tgcttctgaa 
ggagctgctg 
agtggtctta 
tgcagtttgc 
cggtggcgcc 
tccctgtgga 
gtgataaccg 

ggggccctac 

ccccaaatcc 
aggacgatga 
ttgggcaggc 
aaggtcaaac 
ctcagtggta 
caatgaaacc 
ttcttgtaaa 
ctactgaggc 
aagatgtaga 
gtaactcacg 
ttagggacaa 
cgggccaagc 
cataccagct 
aggctgttga 
aacttccaaa 
taaaacctaa 
aaaatgaaat 



cgaagctatg 
ctatggcccc 
aaagaaaaag 
cgcgcccagg 
caccaccgta 
tgaggtgtac 
ctacggaaag 
tagcctaaag 
gcgcggccta 
gcgccagcga 
ggtccgcgtg 
tcagataccc 
aacgtccccg 
gtccaagacc 
gcgcccgcgc 
acatccttcc 
agcaactacc 
cgtgctggcc 
gccaacagcg 
tatggccctc 
taggaggggc 
gcggcgcgcg 
cgccgcggcg 
ctgattaaaa 
gcttggtcct 
gacacggctc 
gtggcgcctt 
agaactatgg 
aagagcaaaa 
tggacctggc 
ccgtagagga 
gtccgcgccc 
aggaggcact 
tgctgggcca 
aacctgtgct 
gccgcgccgc 
cactgaacag 
tagctaacgt 
agccgccgcg 
catgcacatc 
ccgcgccacc 
tacgcacgac 
ccgtgaggat 
tgtgctggac 
ttttaagccc 
ttgcgaatgg 
caacgaagac 
gccttattct 
acctaaatat 
cgaaactgaa 
atgttacggt 
gcaacaaaat 
gaccgcaggc 
tatagaaacc 
agaactaatg 
ttttattggt 
atcgcagttg 
tttgcttgat 
cagctatgat 
ttactgcttt 
aacaggtcag 
aagagttgga 



tccaagcgca 
ccgaagaagg 
aaagatgatg 
cgacgggtac 
gtctttacgc 
ggcgacgagg 
cggcataagg 
cccgtaacac 
aagcgcgagt 
ctggaagatg 
cggccaatca 
actaccagta 
gttgcctcag 
tctacggagg 
ggttcgagga 
attgcgccta 
cgacgccgaa 
ccgatttccg 
cgctaccacc 
acctgccgcc 
atggccggcc 
tcgcaccgtc 
attggcgccg 
acaagttgca 
gtaactattt 
gcgcccgttc 
cagctggggc 
cagcaaggcc 
tttccaacaa 
caaccaggca 
gcctccaccg 
cgacagggaa 
aaagcaaggc 
gcacacaccc 
gccaggcccg 
cagcggtccg 
catcgtgggt 
gtcgtatgtg 
cgcccgcttt 
tcgggccagg 
gagacgtact 
gtgaccacag 
actgcgtact 
atggcttcca 
tactctggca 
gatgaagctg 
gaagtagacg 
ggtataaata 
gccgataaaa 
attaatcatg 
tcatatgcaa 
ggaaagctag 
aatggtgata 
ccagacactc 
ggccaacaat 
ctaatgtatt 
aatgctgttg 
tccattggtg 
ccagatgtta 
ccactgggag 
gaaaatggat 
aataattttg 



17700 
17760 
17820 
17880 
17940 
18000 
18060 
18120 
18180 
18240 
18300 
18360 
18420 
18480 
18540 
18600 
18660 
18720 
18780 
18840 
18900 
18960 
19020 
19080 
19140 
19200 
19260 
19320 
19380 
19440 
19500 
19560 
19620 
19680 
19740 
19800 
19860 
19920 
19980 
20040 
20100 
20160 
20220 
20280 
20340 
20400 
20460 
20520 
20580 
20640 
20700 
20760 
20820 
20880 
20940 
21000 
21060 
21120 
21180 
21240 
21300 
21360 
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ccatggaaat 
tgtatttgcc 
acacctacga 
accttggagc 
gcaatgctgg 
acatccaggt 
acacctacga 
atgacctaag 
tcttccccat 
acgaccagtc 
acgctaccaa 
ccttcacgcg 
acacctactc 
agaaggtggc 
cccccaacga 
gtaacatgac 
agggcttcta 
agcccatgag 
tcctacacca 
gacaggccta 
ttacccagaa 
ttatgtccat 
acgcgctaga 
tgtttgaagt 
tgtacctgcg 
aacaacagct 
tggttgtggg 
acacaagctc 
gatggccttt 
ttctgaccag 
cgccattgct 
ggggcccaac 
ctggccccaa 
ctccatgctc 
cagcttcctg 
cacttctttt 
aggcaaatgc 
cgccgtttaa 
gttgcgatac 
ggtgaagttt 
cgatatcttg 
gttgcagcac 
ggagatcaga 
tagctgcctt 
catcaaaagg 
gatctgctta 
gccggaaaac 
ggagatctgc 
ctccttcagc 
atttatcata 
cagccacaac 
caggtacgcc 
cagctgcaac 
cacttggtca 
catcagcgcg 
cgggttcatc 
ccgcatacca 
tttgccatgc 
ttctctttct 
agaagggcgc 
ccgcgggctg 
ctcgatacgc 



caatctaaat 
cgacaagcta 
ctacatgaac 
acgctggtcc 
cctgcgctac 
gcctcagaag 
gtggaacttc 
ggtfcgacgga 
ggcccacaac 
ctttaacgac 
cgtgcccata 
ccttaagact 
tggctctata 
cattaccttt 
gtttgaaatt 
caaagactgg 
tatcccagag 
ccgtcaggtg 
acacaacaac 
ccctgctaac 
aaagtttctt 
gggcgcactc 
catgactttt 
ctttgacgtg 
cacgcccttc 
gccgccatgg 
ccatattttt 
gcctgcgcca 
gcctggaacc 
cgactcaagc 
tcttcccccg 
tcggccgcct 
actcccatgg 
aacagtcccc 
gagcgccact 
tgtcacttga 
ttttatttgt 
aaatcaaagg 

tggtgtttag 

tcactccaca 
aagtcgcagt 
tggaacacta 
tccgcgtcca 
cccaaaaagg 
tgaccgtgcc 
aaagccacct 
tgattggccg 
accacatttc 
gcgcgctgcc 
atgcttccgt 
gcgcagcccg 
tgcaggaatc 
ccgcggtgct 
ggcagtagtt 
cgcgcagcct 
accgtaattt 
cgcgccactg 
ttgattagca 
tcctcgctgt 
ttctttttct 
ggtgtgcgcg 
cgcctcatcc 



gccaacctgt 
aagtacagtc 
aagcgagtgg 
cttgactata 
cgctcaatgt 
ttctttgcca 
aggaaggatg 
gccagcatta 
accgcctcca 
tatctctccg 
tccatcccct 
aaggaaaccc 
ccctacctag 
gactcttctg 
aagcgctcag 
ttcctggtac 
agctacaagg 
gtggatgata 
tctggatttg 
ttcccctatc 
tgcgatcgca 
acagacctgg 
gaggtggatc 
gtccgtgtgc 
tcggccggca 
gctccagtga 
tgggcaccta 
tagtcaatac 
cgcactcaaa 
aggtttacca 
accgctgtat 
gtggactatt 
atcacaaccc 
aggtacagcc 
cgccctactt 
aaaacatgta 
acactctcgg 
ggttctgccg 
tgctccactt 
ggctgcgcac 
tggggcctcc 
tcagcgccgg 
ggtcctccgc 
gcgcgtgccc 
cggtctgggc 
gagcctttgc 
gacaggccgc 
ggccccaccg 
cgttttcgct 
gtagacactt 
tgggctcgtg 
gccccatcat 
cctcgttcag 
tgaagttcgc 
ccatgccctt 
cactttccgc 
ggtcgtcttc 
ccggtgggtt 
ccacgattac 
tcttgggcgc 
gcaccagcgc 
gcttttttgg 



ggagaaattt 
cttccaacgt 
tggctcccgg 
tggacaacgt 

tgctgggcaa 

ttaaaaacct 
ttaacatggt 
agtttgatag 
cgcttgaggc 
ccgccaacat 
cccgcaactg 
catcactggg 
atggaacctt 
tcagctggcc 
ttgacgggga 
aaatgctagc 
accgcatgta 
ctaaatacaa 
ttggctacct 
cgcttatagg 
ccctttggcg 
gccaaaacct 
ccatggacga 
accggccgca 
acgccacaac 
gcaggaactg 
tgacaagcgc 
ggccggtcgc 
aacatgctac 
gtttgagtac 
aacgctggaa 
ctgctgcatg 
caccatgaac 
caccctgcgt 
ccgcagccac 
aaaataatgt 
gtgattattt 
cgcatcgcta 
aaactcaggc 
catcaccaac 
gccctgcgcg 
gtggtgcacg 
gttgctcagg 
aggctttgag 
gttaggatac 
gccttcagag 
gtcgtgcacg 
gttcttcacg 
cgtcacatcc 
aagctcgcct 
atgcttgtag 
cgtcacaaag 
ccaggtcttg 
ctttagatcg 
ctcccacgca 
ttcgctgggc 
attcagccgc 
gctgaaaccc 
ctctggtgat 
aatggccaaa 
gtcttgtgat 
gggcgcccgg 



cctgtactcc 
aaaaatttct 
gttagtggac 
caacccattt 
tggtcgctat 
ccttctcctg 
tctgcagagc 
catttgcctt 
catgcttaga 
gctctaccct 

ggcggctttc 

ctcgggctac 
ttacctcaac 
tggcaatgac 
gggttacaac 
taactacaac 
ctccttcttt 
ggactaccaa 
tgcccccacc 
caagaccgca 
catcccattc 
tctctacgcc 
gcccaccctt 
ccgcggcgtc 
ataaagaagc 
aaagccattg 
tttccaggct 
gagactgggg 
ctctttgagc 
gagtcactcc 
aagtccaccc 
tttctccacg 
cttattaccg 
cgcaaccagg 
agtgcgcaga 
actagagaca 
acccccaccc 
tgcgccactg 
acaaccatcc 
gcgtttagca 
cgcgagttgc 
ctggccagca 
gcgaacggag 
ttgcactcgc 
agcgcctgca 
aagaacatgc 
cagcaccttg 
atcttggcct 
atttcaatca 
tcgatctcag 
gtcacctctg 
gtcttgttgc 
catacggccg 
ttatccacgt 
gacacgatcg 
tcttcctctt 
cgcactgtgc 
accatttgta 
ggcgggcgct 
tccgccgccg 
gagtcttcct 
ggaggcggcg 



aacatagcgc 
gataacccaa 
tgctacatta 
aaccaccacc 
gtgcccttcc 
ccgggctcat 
tccctaggaa 
tacgccacct 
aacgacacca 
atacccgcca 
cgcggctggg 
gacccttatt 
cacaccttta 
cgcctgctta 
gttgcccagt 
attggctacc 
agaaacttcc 
caggtgggca 
atgcgcgaag 
gttgacagca 
tccagtaact 
aactccgccc 
ctttatgttt 
atcgaaaccg 
aagcaacatc 
tcaaagatct 
ttgtttctcc 
gcgtacactg 
cctttggctt 
tgcgccgtag 
aaagcgtaca 
cctttgccaa 
gggtacccaa 
aacagctcta 
ttaggagcgc 
ctttcaataa 
ttgccgtctg 
gcagggacac 
gcggcagctc 
ggtcgggcgc 
gatacacagg 
cgctcttgtc 
tcaactttgg 
accgtagtgg 
taaaagcctt 
cgcaagactt 
cgtcggtgtt 
tgctagactg 
cgtgctcctt 
cgcagcggtg 
caaacgactg 
tggtgaaggt 
ccagagcttc 
ggtacttgtc 
gcacactcag 
cctcttgcgt 
gcttacctcc 
gcgccacatc 
cgggcttggg 
aggtcgatgg 
cgtcctcgga 
gcgacgggga 



21420 
21480 
21540 
21600 
21660 
21720 
21780 
21840 
21900 
21960 
22020 
22080 
22140 
22200 
22260 
22320 
22380 
22440 
22500 
22560 
22620 
22680 
22740 
22800 
22860 
22920 
22980 
23040 
23100 
23160 
23220 
23280 
23340 
23400 
23460 
23520 
23580 
23640 
23700 
23760 
23820 
23880 
23940 
24000 
24060 
24120 
24180 
24240 
24300 
24360 
24420 
24480 
24540 
24600 
24660 
24720 
24780 
24840 
24900 
24960 
25020 
25080 
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cggggacgac acgtcctcca tggttggggg acgtcgcgcc gcaccgcgtc cgcgctcggg 25140 
ggtggtttcg cgctgctcct cttcccgact ggccatttcc ttctcctata ggcagaaaaa 25200 
gatcatggag tcagtcgaga agaaggacag cctaaccgcc ccctctgagt tcgccaccac 252 60 
cgcctccacc gatgccgcca acgcgcctac caccttcccc gtcgaggcac ccccgcttga 2532 0 
"gg a gg a 99 aa gtgattatcg agcaggaccc aggttttgta agcgaagacg acgaggaccg 25380 
ctcagtacca acagaggata aaaagcaaga ccaggacaac gcagaggcaa acgaggaaca 25440 
agtcgggcgg ggggacgaaa ggcatggcga ctacctagat gtgggagacg acgtgctgtt 25500 
gaagcatctg cagcgccagt gcgccattat ctgcgacgcg ttgcaagagc gcagcgatgt 25560 
gcccctcgcc atagcggatg tcagcctfcgc ctacgaacgc cacctattct caccgcgcgt 2562 0 
accccccaaa cgccaagaaa acggcacatg cgagcccaac ccgcgcctca acttctaccc 25680 
cgtatttgcc gtgccagagg tgcttgccac ctatcacatc tttttccaaa actgcaagat 25740 
acccctatcc tgccgtgcca accgcagccg agcggacaag cagctggcct tgcggcaggg 25800 
cgctgtcata cctgatatcg cctcgctcaa cgaagtgcca aaaatctttg agggtcttgg 25860 
acgcgacgag aagcgcgcgg caaacgctct gcaacaggaa aacagcgaaa atgaaagtca 2592 0 
ctctggagtg ttggtggaac tcgagggtga caacgcgcgc ctagccgtac taaaacgcag 25980 
catcgaggtc acccactttg cctacccggc acttaaccta ccccccaagg tcatgagcac 26040 
agtcatgagt gagctgatcg tgcgccgtgc gcagcccctg gagagggatg caaatttgca 2610 0 
agaacaaaca gaggagggcc tacccgcagt tggcgacgag cagctagcgc gctggcttca 26160 
aacgcgcgag cctgccgact tggaggagcg acgcaaacta atgatggccg cagtgctcgt 2 6220 
taccgtggag cttgagtgca tgcagcggtt ctttgctgac ccggagatgc agcgcaagct 26280 
agaggaaaca ttgcactaca cctttcgaca gggctacgta cgccaggcct gcaagatctc 2634 0 
caacgtggag ctctgcaacc tggtctccta ccttggaatt ttgcacgaaa accgccttgg 26400 
gcaaaacgtg cttcattcca cgctcaaggg cgaggcgcgc cgcgactacg tccgcgactg 26460 
cgtttactta tttctatgct acacctggca gacggccatg ggcgtttggc agcagtgctt 26520 
99 a 99 a 9tgc aacctcaagg agctgcagaa actgctaaag caaaacttga aggacctatg 2 6580 
gacggccttc aacgagcgct ccgtggccgc gcacctggcg gacatcattt tccccgaacg 26640 
cctgcttaaa accctgcaac agggtctgcc agacttcacc agtcaaagca tgttgcagaa 26700 
ctttaggaac tttatcctag agcgctcagg aatcttgccc gccacctgct gtgcacttcc 26760 
tagcgacttt gtgcccatta agtaccgcga atgccctccg ccgctttggg gccactgcta 26820 
ccttctgcag ctagccaact accttgccta ccactctgac ataatggaag acgtgagcgg 2 6880 
tgacggtcta ctggagtgtc actgtcgctg caacctatgc accccgcacc gctccctggt 26940 
ttgcaattcg cagctgctta acgaaagtca aattatcggt acctttgagc tgcagggtcc 27000 
ctcgcctgac gaaaagtccg cggctccggg gttgaaactc actccggggc tgtggacgtc 27060 
ggcttacctt cgcaaatttg tacctgagga ctaccacgcc cacgagatta ggttctacga 27120 
agaccaatcc cgcccgccaa atgcggagct taccgcctgc gtcattaccc agggccacat 27180 
tcttggccaa ttgcaagcca tcaacaaagc ccgccaagag tttctgctac gaaagggacg 27240 
gggggtttac ttggaccccc agtccggcga ggagctcaac ccaatccccc cgccgccgca 27300 
gccctatcag cagcagccgc gggcccttgc ttcccaggat ggcacccaaa aagaagctgc 27360 
a gctgccgcc gccacccacg gacgaggagg aatactggga cagtcaggca gaggaggttt 2742 0 
tggacgagga ggaggaggac atgatggaag actgggagag cctagacgag gaagcttccg 27480 
aggtcgaaga ggtgtcagac gaaacaccgt caccctcggt cgcattcccc tcgccggcgc 2754 0 
cccagaaatc ggcaaccggt tccagcatgg ctacaacctc cgctcctcag gcgccgccgg 27600 
cactgcccgt tcgccgaccc aaccgtagat gggacaccac tggaaccagg gccggtaagt 27660 
ccaagcagcc gccgccgtta gcccaagagc aacaacagcg ccaaggctac cgctcatggc 2772 0 
gcgggcacaa gaacgccata gttgcttgct tgcaagactg tgggggcaac atctccttcg 27780 
cccgccgctt tcttctctac catcacggcg tggccttccc ccgtaacatc ctgcattact 27840 
accgtcatct ctacagccca tactgcaccg gcggcagcgg cagcggcagc aacagcagcg 27900 
gccacacaga agcaaaggcg accggatagc aagactctga caaagcccaa gaaatccaca 27960 
gcggcggcag cagcaggagg aggagcgctg cgtctggcgc ccaacgaacc cgtatcgacc 28020 
cgcgagctta gaaacaggat ttttcccact ctgtatgcta tatttcaaca gagcaggggc 28080 
caagaacaag agctgaaaat aaaaaacagg tctctgcgat ccctcacccg cagctgcctg 28140 
tatcacaaaa gcgaagatca gcttcggcgc acgctggaag acgcggaggc tctcttcagt 28200 
aaatactgcg cgctgactct taaggactag tttcgcgccc tttctcaaat ttaagcgcga 28260 
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ttcacgcctc 
ggcattggaa 
tcgggacctc 
tcggcggacg 
ctggtccact 
gaattgcccg 
gagcttgccc 
aggggaccct 
ttaattaatt 
ttgtgttatg 
tcagtagtat 
acagaaccct 
ctccccggct 
tattccacac 
gcagctcact 
gcggttgctt 
cgtgcatcag 
gctccgtcct 
gcagcataag 
cacagtaact 
atccaaagct 
agattaagtg 
tgtaattcac 
ccatcctaaa 
tggaacaatg 
tatcaatgtt 
cccgcgttag 
tgcagggaag 
gcagcggatg 
ccctactgta 
atggaacgcc 
gatctgcgtc 
tctctcaaag 
gctgccctga 
ttctgcgagt 
ttccaaaaga 
ggtggcgtgg 
aatggcttcc 
agggtgaatc 
tcgccacctt 
aatctgctcc 
tcaggttcct 
tcccgtaggt 
gcggccactt 
ctcggagcta 
aaaatgcaag 
gtagtcatgc 
catttttctc 
aacatttaaa 
cggactacgg 
accgacagct 
tgattcatcg 
gcgtagagac 
cacataaaca 
aacatacagc 
tattaaaaaa 
caagtgcaga 
aacacccaga 
ttcctcaaat 
caattcccaa 
cgccccgcgc 
aaggtatatt 



gtcaggcaat 
ctctgcaatt 
ccggccacta 
gctacgactg 
gtcgccgcca 
aggatcatat 
gtagcctgat 
gtgttctcac 
gccacatcct 
tttcaacgtg 
agccccacca 
agtattcaac 
ggccttaaaa 
ggtttcctgt 
taagttcatg 
aacgggcggc 
gatagggcgg 
gcaggaatac 
gcgccttgtc 
gcagcacagc 
catggcgggg 
gcgacccctc 
cacctcccgg 
ccagctggcc 
acagtggaga 
ggcacaacac 
aaccatatcc 
acctcgcacg 
atcctccagt 
cggagtgcgc 
ggacgtagtc 
tccggtctcg 
catccaggcg 
taacatccac 
cacacacggg 
ttatccaaaa 
tcaaactcta 
aaaaggcaaa 
tcctctataa 
ctcaatatat 
agagcgccct 
cacagacctg 
cccttcgcag 
ccccgccagg 
tgctaaccag 
gtgctgctca 
tcatgcagat 
tcaaacatgt 
cattagaagc 
ccatgccggc 
cctcggtcat 
gtcagtgcta 
aacattacag 
cctgaaaaac 
gcttcacagc 
acaccactcg 
gcgagtatat 
aaaccgcacg 
cgtcacttcc 
cacatacaag 
cacgtcacaa 
attgatgatg 



cctaactctg 
tattgaggag 
tccggatcaa 
aatgttaagt 
caagtgcttt 
cgagggcccg 
tcgggagttt 
tgtgatttgc 
cttacacttt 
tttatttttc 
ccacatagct 
ctgccacctc 
agcatcatat 
cgagccaaac 
tcgctgtcca 
gaaggagaag 
tggtgctgca 
aacatggcag 
ctccgggcac 
accacaatat 
accacagaac 
ataaacacgc 
taccatataa 
aaaacctgcc 
gcccaggact 
aggcacacgt 
cagggaacaa 
taactcacgt 
atggtagcgc 
cgagacaacc 
atatttcctg 
ccgcttagat 
ccccctggct 
caccgcagaa 
aggagcggga 
cctcaaaatg 
cagccaaaga 
cggccctcac 
acattccagc 
ctctaagcaa 
ccaccttcag 
tataagattc 
ggccagctga 
aaccttgaca 
cgtagccccg 
aaaaatcagg 
aaaggcaggt 
ctgcgggttt 
ctgtcttaca 
gtgaccgtaa 
gtccggagtc 
aaaagcgacc 
cccccatagg 
cctcctgcct 
ggcagcctaa 
acacggcacc 
ataggactaa 
cgaacctacg 
gttttcccac 
ttactccgcc 
actccacccc 



cagacctcgt 
tttgtgccat 
tttattccta 
ggagaggcag 
gcccgcgact 
gcgcacggcg 
acccagcgcc 
aactgtccta 
ttcatacatt 
aattgcagaa 
tatacagatc 
cctcccaaca 
catgggtaac 
gctcatcagt 
gctgctgagc 
tccacgccta 
gcagcgcgcg 
tggtctcctc 
agcagcgcac 
tgttcaaaat 
ccacgtggcc 
tggacataaa 
acctctgatt 
cgccggctat 
cgtaaccatg 
gcatacactt 
cccattcctg 
tgtgcattgt 
gggtttctgt 
gagatcgtgt 
aagcaaaacc 
cgctctgtgt 
tcgggttcta 
taagccacac 
agagctggaa 
aagatctatt 
acagataatg 
gtccaagtgg 
accttcaacc 
atcccgaata 
cctcaagcag 
aaaagcggaa 
acataatcgt 
aaagaaccca 
atgtaagctt 
caaagcctcg 
aagctccgga 
ctgcataaac 
acaggaaaaa 
aaaaactggt 
ataatgtaag 
gaaatagccc 
aggtataaca 
aggcaaaata 
cagtcagcct 
agctcaatca 
aaaatgacgt 
cccagaaacg 
gttacgtaac 
ctaaaaccta 
ctcattatca 



cctctgagcc 
cggtctactt 
actttgacgc 
agcaactgcg 
ccggtgagtt 
tccggcttac 
ccctgctagt 
accttggatt 
gcccaagaat 
aatttcaagt 
accgtacctt 
cacagagtac 
agacatattc 
gatattaata 
cacaggctgc 
catgggggta 
aataaactgc 
agcgatgatt 
cctgatctca 
cccacagtgc 
atcataccac 
cattacctct 
aaacatggcg 
acactgcagg 
gatcatcatg 
cctcaggatt 
aatcagcgta 
caaagtgtta 
ctcaaaagga 
tggtcgtagt 
aggtgcgggc 
agtagttgta 
tgtaaactcc 
ccagccaacc 
gaaccatgtt 
aagtgaacgc 
gcatttgtaa 
acgtaaaggc 
atgcccaaat 
ttaagtccgg 
cgaatcatga 
cattaacaaa 
gcaggtctgc 
cactgattat 
tgttgcatgg 
cgcaaaaaag 
accaccacag 
acaaaataaa 
caacccttat 
caccgtgatt 
actcggtaaa 

aaattaatag 
gcaccctccc 
taccagtaaa 
gtcacagtgt 
aacggttaaa 
aaagccaaaa 
ttcccatttt 
cgtcacccgc 
tattggcttc 



gcgctctgga 
taaccccttc 
ggtaaaggac 
cctgaaacac 
ttgctacttt 
cgcccaggga 
tgagcgggac 
acatcaagat 
aaagaatcgt 
catttttcat 
aatcaaactc 
acagtccttt 
ttaggtgtta 
aactccccgg 
tgtccaactt 
gagtcataat 
tgccgccgcc 
cgcaccgccc 
cttaaatcag 
aaggcgctgt 
aagcgcaggt 
tttggcatgt 
ccatccacca 
gaaccgggac 
ctcgtcatga 
acaagctcct 
aatcccacac 
cattcgggca 
ggtagacgat 
gtcatgccaa 
gtgacaaaca 
gtatatccac 
ttcatgcgcc 
tacacattcg 
ttttttttta 
gctcccctcc 
gatgttgcac 
taaacccttc 
aattctcatc 
ccattgtaaa 
ttgcaaaaat 
aataccgcga 
acggaccagc 
gacacgcata 
gcggcgatat 
aaagcacatc 
aaaaagacac 
ataacaaaaa 
aagcataaga 
aaaaagcacc 
cacatcaggt 
atacccgcag 
gagagaaaaa 
gctccagaac 
aaagaaaacc 
aaaaaagggc 
gtccacaaaa 
aacccacaac 
aagaaaacta 
cccgttccca 
aatccaaaat 



28860 
28920 
28980 
29040 
29100 
29160 
29220 
29280 
29340 
29400 
29460 
29520 
29580 
29640 
29700 
29760 
29820 
29880 
29940 
30000 
30060 
30120 
30180 
30240 
30300 
30360 
30420 
30480 
30540 
30600 
30660 
30720 
30780 
30840 
30900 
30960 
31020 
31080 
31140 
31200 
31260 
31320 
31380 
31440 
31500 
31560 
31620 
31680 
31740 
31800 
31860 
31920 
31980 
32040 
32100 
32160 
32220 
32280 
32340 
32400 
32460 
32480 
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<210> 24 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 24 

ctcaacaatt gtggatccgt actcc 2 5 

<210> 25 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 25 

gtgctcagca gatcttgcga ctgtg 25 



<210> 26 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 26 

ggcgcgttcg gatccactct cttcc 25 

<210> 27 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 27 

ctacatgcta ggcagatctc gttcggag 28 



<210> 28 
<211> 1240 
<212> DNA 
<213> adenovirus 

<400> 28 

ggatccactc tcttccgcat cgctgtctgc 
ctgaaaagcg ggcatgactt ctgcgctaag 
gatattcacc tggcccgcgg tgatgccttt 
gacaatcttt ttgttgtcaa gcttggtggc 
cttggcgatg gagcgcaggg tttggttttt 
gtttagctgc acgtattcgc gcgcaacgca 
gtcgggcacc aggtgcacgc gccaaccgcg 
ggctacctct ccgcgtaggc gctcgttggt 
gaatggcggt agggggtcta gctgcgtctc 
cccgggcagc aggcgcgcgt cgaagtagtc 



gagggccagc tgttggggtg agtactccct 60 
attgtcagtt tccaaaaacg aggaggattt 120 
gagggtggcc gcatccatct ggtcagaaaa 180 
aaacgacccg tagagggcgt tggacagcaa 240 
gtcgcgatcg gcgcgctcct tggccgcgat 3 00 
ccgccattcg ggaaagacgg tggtgcgctc 360 
gttgtgcagg gtgacaaggt caacgctggt 420 
ccagcagagg cggccgccct tgcgcgagca 4 80 
gtccgggggg tctgcgtcca cggtaaagac 540 
tatcttgcat ccttgcaagt ctagcgcctg 600 
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ctgccatgcg cgggcggcaa gcgcgcgctc gtatgggttg agtgggggac cccatggcat 660 
ggggtgggtg agcgcggagg cgtacatgcc gcaaatgtcg taaacgtaga ggggctctct 720 
gagtattcca agatatgtag ggtagcatct tccaccgcgg atgctggcgc gcacgtaatc 780 
gtatagttcg tgcgagggag cgaggaggtc gggaccgagg ttgctacggg cgggctgctc 840 
tgctcggaag actatctgcc tgaagatggc atgtgagttg gatgatatgg ttggacgctg 900 
gaagacgttg aagctggcgt ctgtgagacc taccgcgtca cgcacgaagg aggcgtagga 960 
gtcgcgcagc ttgttgacca gctcggcggt gacctgcacg tctagggcgc agtagtccag 102 0 
ggtttccttg atgatgtcat acttatcctg tccctttttt ttccacagct cgcggttgag 1080 
gacaaactct- tcgcggtctt tccagtactc ttggatcgga aacccgtcgg cctccgaacg 114 0 
agatccgtac tccgccgccg agggacctga gcgagtccgc atcgaccgga tcggaaaacc 1200 
tctcgagaaa ggcgtctaac cagtcacagt cgcaagatct 1240 

<210> 29 
<211> 8383 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plasmid pDV60 
<400> 29 

gacggatcgg gagatctccc gatcccctat ggtcgactct cagtacaatc tgctctgatg 60 
ccgcatagtt aagccagtat ctgctccctg cttgtgtgtt ggaggtcgct gagtagtgcg 120 
cgagcaaaat ttaagctaca acaaggcaag gcttgaccga caattgcatg aagaatctgc 180 
ttagggttag gcgttttgcg ctgcttcgcg atgtacgggc cagatatacg cgttgacatt 240 
gattattgac tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata 3 00 
tggagttccg cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc 360 
cccgcccatt gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc 420 
attgacgtca atgggtggac tatttacggt aaactgccca cttggcagta catcaagtgt 480 
atcatatgcc aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt 540 
atgcccagta catgacctta tgggactttc ctacttggca gtacatctac gtattagtca 600 
tcgctattac catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg 660 
actcacgggg atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc 720 
aaaatcaacg ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg 780 
gtaggcgtgt acggtgggag gtctatataa gcagagctct ctggctaact agagaaccca 840 
ctgcttactg gcttatcgaa attaatacga ctcactatag ggagacccaa gcttggtacc 900 
gagctcggat ccactctctt ccgcatcgct gtctgcgagg gccagctgtt ggggtgagta 960 
ctccctctga aaagcgggca tgacttctgc gctaagattg tcagtttcca aaaacgagga 102 0 
ggatttgata ttcacctggc "ccgcggtgat gcctttgagg gtggccgcat ccatctggtc 1080 
agaaaagaca atctttttgt tgtcaagctt ggtggcaaac gacccgtaga gggcgttgga 1140 
cagcaacttg gcgatggagc gcagggtttg gtttttgtcg cgatcggcgc gctccttggc 1200 
cgcgatgttt agctgcacgt attcgcgcgc aacgcaccgc cattcgggaa agacggtggt 1260 
gcgctcgtcg ggcaccaggt gcacgcgcca accgcggttg tgcagggtga caaggtcaac 132 0 
gctggtggct acctctccgc gtaggcgctc gttggtccag cagaggcggc cgcccttgcg 1380 
cgagcagaat ggcggtaggg ggtctagctg cgtctcgtcc ggggggtctg cgtccacggt 1440 
aaagaccccg ggcagcaggc gcgcgtcgaa gtagtctatc ttgcatcctt gcaagtctag 1500 
cgcctgctgc catgcgcggg cggcaagcgc gcgctcgtat gggttgagtg ggggacccca 1560 
tggcatgggg tgggtgagcg cggaggcgta catgccgcaa atgtcgtaaa cgtagagggg 162 0 
ctctctgagt attccaagat atgtagggta gcatcttcca ccgcggatgc tggcgcgcac 1680 
gtaatcgtat agttcgtgcg agggagcgag gaggtcggga ccgaggttgc tacgggcggg 1740 
ctgctctgct cggaagacta tctgcctgaa gatggcatgt gagttggatg atatggttgg 1800 
acgctggaag acgttgaagc tggcgtctgt gagacctacc gcgtcacgca cgaaggaggc 1860 
gtaggagtcg cgcagcttgt tgaccagctc ggcggtgacc tgcacgtcta gggcgcagta 1920 
gtccagggtt tccttgatga tgtcatactt atcctgtccc ttttttttcc acagctcgcg 1980 
gttgaggaca aactcttcgc ggtctttcca gtactcttgg atcggaaacc cgtcggcctc 2040 
cgaacgagat ccgtactccg ccgccgaggg acctgagcga gtccgcatcg accggatcgg 2100 
aaaacctctc gagaaaggcg tctaaccagt cacagtcgca agatccaaga tgaagcgcgc 2160 
aagaccgtct gaagatacct tcaaccccgt gtatccatat gacacggaaa ccggtcctcc 2220 
aactgtgcct tttcttactc ctccctttgt atcccccaat gggtttcaag agagtccccc 2280 
tggggtactc tctttgcgcc tatccgaacc tctagttacc tccaatggca tgcttgcgct 2340 
caaaatgggc aacggcctct ctctggacga ggccggcaac cttacctccc aaaatgtaac 2400 
cactgtgagc ccacctctca aaaaaaccaa gtcaaacata aacctggaaa tatctgcacc 2460 
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cctcacagtt acctcagaag ccctaactgt 
caacacactc accatgcaat cacaggcccc 
tgccacccaa ggacccctca cagtgtcaga 
cctcaccacc accgatagca gtacccttac 
cactggtagc ttgggcattg acttgaaaga 
actaaagtac ggggctcctt tgcatgtaac 
tggtccaggt gtgactatta ataatacttc 
ttttgattca caaggcaata tgcaacttaa 
aaacagacgc cttatacttg atgttagtta 
aagactagga cagggccctc tttttataaa 
caaaggcctt tacttgttta cagcttcaaa 
cactgccaag gggttgatgt ttgacgctac 
tgaatttggt tcacctaatg caccaaacac 
cctagaattt gattcaaaca aggctatggt 
cagcacaggt gccattacag taggaaacaa 
accagctcca tctcctaact gtagactaaa 
cttaacaaaa tgtggcagtc aaatacttgc 
tttggctcca atatctggaa cagttcaaag 
tggagtgcta ctaaacaatt ccttcctgga 
tcttactgaa ggcacagcct atacaaacgc 
tccaaaatct cacggtaaaa ctgccaaaag 
agacaaaact aaacctgtaa cactaaccat 
cacaactcca agtgcatact ctatgtcatt 
taatgaaata tttgccacat cctcttacac 
gcggccgctc gagcatgcat ctagagggcc 
ctcgctgatc agcctcgact gtgccttcta 
ccgtgccttc cttgaccctg gaaggtgcca 
aaattgcatc gcattgtctg agtaggtgtc 
acagcaaggg ggaggattgg gaagacaata 
tggcttctga ggcggaaaga accagctggg 
gcggcgcatt aagcgcggcg ggtgtggtgg 
gcgccctagc gcccgctcct ttcgctttct 
ttccccgtca agctctaaat cggggcatcc 
acctcgaccc caaaaaactt gattagggtg 
agacggtttt tcgccctttg acgttggagt 
aaactggaac aacactcaac cctatctcgg 
ggatttcggc ctattggtta aaaaatgagc 
tctgtggaat gtgtgtcagt tagggtgtgg 
gtatgcaaag catgcatctc aattagtcag 
cagcaggcag aagtatgcaa agcatgcatc 
taactccgcc catcccgccc ctaactccgc 
gactaatttt ttttatttat gcagaggccg 
agtagtgagg aggctttttt ggaggcctag 
tatccatttt cggatctgat caagagacag 
atggattgca cgcaggttct ccggccgctt 
cacaacagac aatcggctgc tctgatgccg 
cggttctttt tgtcaagacc gacctgtccg 
cgcggctatc gtggctggcc acgacgggcg 
ctgaagcggg aagggactgg ctgctattgg 
ctcaccttgc tcctgccgag aaagtatcca 
cgcttgatcc ggctacctgc ccattcgacc 
gtactcggat ggaagccggt cttgtcgatc 
tcgcgccagc cgaactgttc gccaggctca 
tcgtgaccca tggcgatgcc tgcttgccga 
gattcatcga ctgtggccgg ctgggtgtgg 
cccgtgatat tgctgaagag cttggcggcg 
gtatcgccgc tcccgattcg cagcgcatcg 
gagcgggact ctggggttcg aaatgaccga 
tttcgattcc accgccgcct tctatgaaag 
cggctggatg atcctccagc gcggggatct 
gtttattgca gcttataatg gttacaaata 
agcatttttt tcactgcatt ctagttgtgg 



ggctgccgcc gcacctctaa tggtcgcggg 2 52 0 
gctaaccgtg cacgactcca aacttagcat 258 0 
aggaaagcta gccctgcaaa catcaggccc 264 0 
tatcactgcc tcaccccctc taactactgc 2700 
gcccatttat acacaaaatg gaaaactagg 2760 
agacgaccta aacactttga ccgtagcaac 2 82 0 
cttgcaaact aaagttactg gagccttggg 288 0 
tgtagcagga ggactaagga ttgattctca 294 0 
tccgtttgat gctcaaaacc aactaaatct 3 00 0 
ctcagcccac aacttggata ttaactacaa 3060 
caattccaaa aagcttgagg ttaacctaag 312 0 
agccatagcc attaatgcag gagatgggct 3180 
aaatcccctc aaaacaaaaa ttggccatgg 3240 
tcctaaacta ggaactggcc ttagttttga 3300 
aaataatgat aagctaactt tgtggaccac 3360 
tgcagagaaa gatgctaaac tcactttggt 3420 
tacagtttca gttttggctg ttaaaggcag 3480 
tgctcatctt attataagat ttgacgaaaa 3540 
cccagaatat tggaacttta gaaatggaga 360 0 
tgttggattt atgcctaacc tatcagctta 3 66 0 
taacattgtc agtcaagttt acttaaacgg 3 720 
tacactaaac ggtacacagg aaacaggaga 3 780 
ttcatgggac tggtctggcc acaactacat 384 0 
tttttcatac attgcccaag aataaaagaa 3900 
ctattctata gtgtcaccta aatgctagag 3960 
gttgccagcc atctgttgtt tgcccctccc 4020 
ctcccactgt cctttcctaa taaaatgagg 4080 
attctattct ggggggtggg gtggggcagg 414 0 
gcaggcatgc tggggatgcg gtgggctcta 4200 
gctctagggg gtatccccac gcgccctgta 4260 
ttacgcgcag cgtgaccgct acacttgcca 4320 
tcccttcctt tctcgccacg ttcgccggct 4380 
ctttagggtt ccgatttagt gctttacggc 4440 
atggttcacg tagtgggcca tcgccctgat 4500 
ccacgttctt taatagtgga ctcttgttcc 4560 
tctattcttt tgatttataa gggattttgg 4620 
tgatttaaca aaaatttaac gcgaattaat 4680 
aaagtcccca ggctccccag gcaggcagaa 4740 
caaccaggtg tggaaagtcc ccaggctccc 4800 
tcaattagtc agcaaccata gtcccgcccc 4860 
ccagttccgc ccattctccg ccccatggct 4920 
aggccgcctc tgcctctgag ctattccaga 4980 
gcttttgcaa aaagctcccg ggagcttgta 504 0 
gatgaggatc gtttcgcatg attgaacaag 5100 
gggtggagag gctattcggc tatgactggg 5160 
ccgtgttccg gctgtcagcg caggggcgcc 522 0 
gtgccctgaa tgaactgcag gacgaggcag 528 0 
ttccttgcgc agctgtgctc gacgttgtca 5340 
gcgaagtgcc ggggcaggat ctcctgtcat 540 0 
tcatggctga tgcaatgcgg cggctgcata 5460 
accaagcgaa acatcgcatc gagcgagcac 552 0 
aggatgatct ggacgaagag catcaggggc 5580 
aggcgcgcat gcccgacggc gaggatctcg 5640 
atatcatggt ggaaaatggc cgcttttctg 5700 
cggaccgcta tcaggacata gcgttggcta 5760 
aatgggctga ccgcttcctc gtgctttacg 5820 
ccttctatcg ccttcttgac gagttcttct 5880 
ccaagcgacg cccaacctgc catcacgaga 5940 
gttgggcttc ggaatcgttt tccgggacgc 6000 
catgctggag ttcttcgccc accccaactt 6060 
aagcaatagc atcacaaatt tcacaaataa 6120 
tttgtccaaa ctcatcaatg tatcttatca 6180 
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tgtctgtata ccgtcgacct ctagctagag cttggcgtaa tcatggtcat agctgtttcc 624 0 
tgtgtgaaat tgttatccgc tcacaattcc acacaacata cgagccggaa gcataaagtg 6300 
taaagcctgg ggtgcctaat gagtgagcta actcacatta attgcgttgc gctcactgcc 6360 
cgctttccag tcgggaaacc tgtcgtgcca gctgcattaa tgaatcggcc aacgcgcggg 6420 
gagaggcggt ttgcgtattg ggcgctcttc cgcttcctcg ctcactgact cgctgcgctc 6480 
ggtcgttcgg ctgcggcgag cggtatcagc tcactcaaag gcggtaatac ggttatccac 6540 
agaatcaggg gataacgcag gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa 6600 
ccgtaaaaag gccgcgttgc tggcgttttt ccataggctc cgcccccctg acgagcatca 6660 
caaaaatcga cgctcaagtc agaggtggcg aaacccgaca ggactataaa gataccaggc 6720 
gtttccccct ggaagctccc tcgtgcgctc tcctgttccg accctgccgc ttaccggata 6780 
cctgtccgcc tttctccctt cgggaagcgt ggcgcttfcct caatgctcac gctgtaggta 684 0 
tctcagttcg gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac cccccgttca 690 0 
gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag tccaacccgg taagacacga 6960 
cttatcgcca ctggcagcag ccactggtaa caggattagc agagcgaggt atgtaggcgg 7020 
tgctacagag ttcttgaagt ggtggcctaa ctacggctac actagaagga cagtatttgg 7080 
tatctgcgct ctgctgaagc cagttacctt cggaaaaaga gttggtagct cttgatccgg 714 0 
caaacaaacc accgctggta gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag 720 0 
aaaaaaagga tctcaagaag atcctttgat cttttctacg gggtctgacg ctcagtggaa 7260 
cgaaaactca cgttaaggga ttttggtcat gagattatca aaaaggatct tcacctagat 7320 
ccttttaaat taaaaatgaa gttttaaatc aatctaaagt atatatgagt aaacttggtc 7380 
tgacagttac caatgcttaa tcagtgaggc acctatctca gcgatctgtc tatttcgttc 7440 
atccatagtt gcctgactcc ccgtcgtgta gataactacg atacgggagg gcttaccatc 750 0 
tggccccagt gctgcaatga taccgcgaga cccacgctca ccggctccag atttatcagc 7560 
aataaaccag ccagccggaa gggccgagcg cagaagtggt cctgcaactt tatccgcctc 7620 
catccagtct attaattgtt gccgggaagc tagagtaagt agttcgccag ttaatagttt 7680 
gcgcaacgtt gttgccattg ctacaggcat cgtggtgtca cgctcgtcgt ttggtatggc 7740 
ttcattcagc tccggttccc aacgatcaag gcgagttaca tgatccccca tgttgtgcaa 7800 
aaaagcggtt agctccttcg gtcctccgat cgttgtcaga agtaagttgg ccgcagtgtt 7860 
atcactcatg gttatggcag cactgcataa ttctcttact gtcatgccat ccgtaagatg 792 0 
cttttctgtg actggtgagt actcaaccaa gtcattctga gaatagtgta tgcggcgacc 7980 
gagttgctct tgcccggcgt caatacggga taataccgcg ccacatagca gaactttaaa 8040 
agtgctcatc attggaaaac gttcttcggg gcgaaaactc tcaaggatct taccgctgtt 810 0 
gagatccagt tcgatgtaac ccactcgtgc acccaactga tcttcagcat cttttacttt 8160 
caccagcgtt tctgggtgag caaaaacagg aaggcaaaat gccgcaaaaa agggaataag 822 0 
ggcgacacgg aaatgttgaa tactcatact cttccttttt caatattatt gaagcattta 8280 
tcagggttat tgtctcatga gcggatacat atttgaatgt atttagaaaa ataaacaaat 8340 
aggggttccg cgcacatttc cccgaaaagt gccacctgac gtc 83 83 

<210> 30 
<211> 7960 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plasmid pDV67 
<400> 30 

gacggatcgg gagatctccc gatcccctat ggtcgactct cagtacaatc tgctctgatg 60 
ccgcatagtt aagccagtat ctgctccctg cttgtgtgtt ggaggtcgct gagtagtgcg 120 
cgagcaaaat ttaagctaca. acaaggcaag gcttgaccga caattgcatg aagaatctgc 180 
ttagggttag gcgttttgcg ctgcttcgcg atgtacgggc cagatatacg cgttgacatt 240 
gattattgac tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata 300 
tggagttccg cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc 360 
cccgcccatt gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc 420 
attgacgtca atgggtggac tatttacggt aaactgccca cttggcagta catcaagtgt 480 
atcatatgcc aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt 540 
atgcccagta catgacctta tgggactttc ctacttggca gtacatctac gtattagtca 600 
tcgctattac catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg 660 
actcacgggg atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc 720 
aaaatcaacg ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg 780 
gtaggcgtgt acggtgggag gtctatataa gcagagctct ctggctaact agagaaccca 840 
ctgcttactg gcttatcgaa attaatacga ctcactatag ggagacccaa gctggctagc 900 
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gtttaaactt 
gggccagctg 
tgtcagtttc 

gggtggccgc 

acgacccgta 
cgcgatcggc 
gccattcggg 
tgtgcagggt 
agcagaggcg 
ccggggggtc 
tcttgcatcc 
atgggttgag 
aaatgtcgta 
caccgcggat 
gaccgaggtt 
gtgagttgga 
ccgcgtcacg 
cctgcacgtc 
cctttttttt 
ggatcggaaa 
gagtccgcat 
caagatccaa 
atgacacgga 
atgggtttca 
cctccaatgg 
accttacctc 
taaacctgga 
ccgcacctct 
tgqacgactc 
tagccctgca 
cctcaccccc 
atacacaaaa 
taaacacttt 
ctaaagttac 
gaggactaag 
atgctcaaaa 
acaacttgga 
aaaagcttga 
ccattaatgc 
tcaaaacaaa 
taggaactgg 
ataagctaac 
aagatgctaa 
cagttttggc 
ttattataag 
attggaactt 
ttatgcctaa 
tcagtcaagt 
acggtacaca 
actggtctgg 
acattgccca 
gatcagcctc 
cttccttgac 
catcgcattg 
agggggagga 
ctgaggcgga 
cattaagcgc 
tagcgcccgc 
gtcaagctct 
accccaaaaa 
tttttcgccc 
gaacaacact 



aagcttggta 
ttggggtgag 
caaaaacgag 
atccatctgg 
gagggcgttg 
gcgctccttg 
aaagacggtg 
gacaaggtca 
gccgcccttg 
tgcgtccacg 
ttgcaagtct 
tgggggaccc 
aacgtagagg 
gctggcgcgc 
gctacgggcg 
tgatatggtt 
cacgaaggag 
tagggcgcag 
ccacagctcg 
cccgtcggcc 
cgaccggatc 
gatgaagcgc 
aaccggtcct 
agagagtccc 
catgcttgcg 
ccaaaatgta 
aatatctgca 
aatggtcgcg 
caaacttagc 
aacatcaggc 
tctaactact 
tggaaaacta 
gaccgtagca 
tggagccttg 
gattgattct 
ccaactaaat 
tattaactac 
ggttaaccta 
aggagatggg 
aattggccat 
ccfctagtttt 
tttgtggacc 
actcactttg 
tgttaaaggc 
atttgacgaa 
tagaaatgga 
cctatcagct 
ttacttaaac 
ggaaacagga 
ccacaactac 
agaataaaag 
gactgtgcct 
cctggaaggt 
tctgagtagg 
ttgggaagac 
aagaaccagc 
ggcgggtgtg 
tcctttcgct 
aaatcggggc 
acttgattag 
tttgacgttg 
caaccctatc 



ccgagctcgg 
tactccctct 
gaggatttga 
tcagaaaaga 
gacagcaact 
gccgcgatgt 
gtgcgctcgt 
acgctggtgg 
cgcgagcaga 
gtaaagaccc 
agcgcctgct 
catggcatgg 
ggctctctga 
acgtaatcgt 
ggctgctctg 
ggacgctgga 
gcgtaggagt 
tagtccaggg 
cggttgagga 
tccgaacgag 
ggaaaacctc 
gcaagaccgt 
ccaactgtgc 
cctggggtac 
ctcaaaatgg 
accactgtga 
cccctcacag 
ggcaacacac 
attgccaccc 
cccctcacca 
gccactggta 
ggactaaagt 
actggtccag 
ggttttgatt 
caaaacagac 
ctaagactag 
aacaaaggcc 
agcactgcca 
cttgaatttg 
ggcctagaat 
gacagcacag 
acaccagctc 
gtcttaacaa 
agtttggctc 
aatggagtgc 
gatcttactg 
tatccaaaat 
ggagacaaaa 
gacacaactc 
attaatgaaa 
aagcggccgc 
tctagttgcc 
gccactccca 
tgtcattcta 
aatagcaggc 
tggggctcta 
gtggttacgc 
ttcttccctt 
atccctttag 
ggtgatggtt 
gagtccacgt 
tcggtctatt 



atccactctc 
gaaaagcggg 
tattcacctg 
caatcttttt 
tggcgatgga 
ttagctgcac 
cgggcaccag 
ctacctctcc 
atggcggtag 
cgggcagcag 
gccatgcgcg 
ggtgggtgag 
gtattccaag 
atagttcgtg 
ctcggaagac 
agacgttgaa 
cgcgcagctt 
tttccttgat 
caaactcttc 
atccgtactc 
tcgagaaagg 
ctgaagatac 
cttttcttac 
tctctttgcg 
gcaacggcct 
gcccacctct 
ttacctcaga 
tcaccatgca 
aaggacccct 
ccaccgatag 
gcttgggcat 
acggggctcc 
gtgtgactat 
cacaaggcaa 
gccttatact 
gacagggccc 
tttacttgtt 
aggggttgat 
gttcacctaa 
ttgattcaaa 
gtgccattac 
catctcctaa 
aatgtggcag 
caatatctgg 
tactaaacaa 
aaggcacagc 
ctcacggtaa 
ctaaacctgt 
caagtgcata 
tatttgccac 
tcgagtctag 
agccatctgt 
ctgtcctttc 
ttctgggggg 
atgctgggga 

gggggtatcc 

gcagcgtgac 
cctttctcgc 
ggttccgatt 
cacgtagtgg 
tctttaatag 
cttttgattt 



ttccgcatcg 
catgacttct 
gcccgcggtg 
gttgtcaagc 
gcgcagggtt 
gtattcgcgc 
gtgcacgcgc 
gcgtaggcgc 
ggggtctagc 
gcgcgcgtcg 
ggcggcaagc 
cgcggaggcg 
atatgtaggg 
cgagggagcg 
tatctgcctg 
gctggcgtct 
gttgaccagc 
gatgtcatac 
gcggtctttc 
cgccgccgag 
cgtctaacca 
cttcaacccc 
tcctcccttt 
cctatccgaa 
ctctctggac 
caaaaaaacc 
agccctaact 
atcacaggcc 
cacagtgtca 
cagtaccctt 
tgacttgaaa 
tttgcatgta 
taataatact 
tatgcaactt 
tgatgttagt 
tctttttata 
tacagcttca 
gtttgacgct 
tgcaccaaac 
caaggctatg 
agtaggaaac 
ctgtagacta 
tcaaatactt 
aacagttcaa 
ttccttcctg 
ctatacaaac 
aactgccaaa 
aacactaacc 
ctctatgtca 
atcctcttac 
agggcccgtt 
tgtttgcccc 
ctaataaaat 
tggggtgggg 
tgcggtgggc 
ccacgcgccc 
cgctacactt 
cacgttcgcc 
tagtgcttta 
gccatcgccc 
tggactcttg 
ataagggatt 



ctgtctgcga 
gcgctaagat 
atgcctttga 
ttggtggcaa 
tggtttttgt 
gcaacgcacc 
caaccgcggt 
tcgttggtcc 
tgcgtctcgt 
aagtagtcta 
gcgcgctcgt 
tacatgccgc 
tagcatcttc 
aggaggtcgg 
aagatggcat 
gtgagaccta 
tcggcggtga 
ttatcctgtc 
cagtactctt 
ggacctgagc 
gtcacagtcg 
gtgtatccat 
gtatccccca 
cctctagtta 
gaggccggca 
aagtcaaaca 
gtggctgccg 
ccgctaaccg 
gaaggaaagc 
actatcactg 
gagcccattt 
acagacgacc 
tccttgcaaa 
aatgtagcag 
tatccgtttg 
aactcagccc 
aacaattcca 
acagccatag 
acaaatcccc 
gttcctaaac 
aaaaataatg 
aatgcagaga 
gctacagttt 
agtgctcatc 
gacccagaat 
gctgttggat 
agtaacattg 
attacactaa 
ttttcatggg 
actttttcat 
taaacccgct 
tcccccgtgc 
gaggaaattg 
caggacagca 
tctatggctt 
tgtagcggcg 
gccagcgccc 
ggctttcccc 
cggcacctcg 
tgatagacgg 
ttccaaactg 
ttggggattt 



960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 
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cggcctattg 
gaatgtgtgt 
aaagcatgca 
gcagaagtat 
cgcccatccc 
ttttttttat 
gaggaggctt 
ttttcggatc 
ataatacgac 
caccgcgcgc 
ggacttcgtg 
cgcggtccag 
ggacgagctg 
gccggccatg 
ggccggcaac 
cgattccacc 
ctggatgatc 
tattgcagct 
atttttttca 
ctgtataccg 
gtgaaattgt 
agcctggggt 
tttccagtcg 
aggcggtttg 
cgttcggctg 
atcaggggat 
taaaaaggcc 
aaatcgacgc 
tccccctgga 
gtccgccttt 
cagttcggtg 
cgaccgctgc 
atcgccactg 
tacagagttc 
ctgcgctctg 
acaaaccacc 
aaaaggatct 
aaactcacgt 
tttaaattaa 
cagttaccaa 
catagttgcc 
ccccagtgct 
aaaccagcca 
ccagtctatt 
caacgttgtt 
attcagctcc 
agcggttagc 
actcatggtt 
ttctgtgact 
ttgctcttgc 
gctcatcatt 
atccagttcg 
cagcgtttct 
gacacggaaa 
gggttattgt 
ggttccgcgc 



gttaaaaaat 
cagttagggt 
tctcaattag 
gcaaagcatg 
gcccctaact 
ttatgcagag 
ttttggaggc 
tgatcagcac 
aaggtgagga 
gacgtcgccg 
gaggacgact 
gaccaggtgg 
tacgccgagt 
accgagatcg 
tgcgtgcact 
gccgccttct 
ctccagcgcg 
tataatggtt 
ctgcattcta 
tcgacctcta 
tatccgctca 
gcctaatgag 
ggaaacctgt 
cgtattgggc 
cggcgagcgg 
aacgcaggaa 
gcgttgctgg 
tcaagtcaga 
agctccctcg 
ctcccttcgg 
taggtcgttc 
gccttatccg 
gcagcagcca 
ttgaagtggt 
ctgaagccag 
gctggtagcg 
caagaagatc 
taagggattt 
aaatgaagtt 
tgcttaatca 
tgactccccg 
gcaatgatac 
gccggaaggg 
aattgttgcc 
gccattgcta 
ggttcccaac 
tccttcggtc 
atggcagcac 
ggtgagtact 
ccggcgtcaa 
ggaaaacgtt 
atgtaaccca 
gggtgagcaa 
tgttgaatac 
ctcatgagcg 
acatttcccc 



gagctgattt 
gtggaaagtc 
tcagcaacca 
catctcaatt 
ccgcccagtt 
gccgaggccg 
ctaggctttt 
gtgttgacaa 
actaaaccat 
gagcggtcga 
tcgccggtgt 
tgccggacaa 
ggtcggaggt 
gcgagcagcc 
tcgtggccga 
atgaaaggtt 
gggatctcat 
acaaataaag 
gttgtggttt 
gctagagctt 
caattccaca 
tgagctaact 
cgtgccagct 
gctcttccgc 
tatcagctca 
agaacatgtg 
cgtttttcca 
ggtggcgaaa 
tgcgctctcc 
gaagcgtggc 
gctccaagct 
gtaactatcg 
ctggtaacag 
ggcctaacta 
ttaccttcgg 
gtggtttttt 
ctttgatctt 
tggtcatgag 
ttaaatcaat 
gtgaggcacc 
tcgtgtagat 
cgcgagaccc 
ccgagcgcag 
gggaagctag 
caggcatcgt 
gatcaaggcg 
ctccgatcgt 
tgcataattc 
caaccaagtc 
tacgggataa 
cttcggggcg 
ctcgtgcacc 
aaacaggaag 
tcatactctt 
gatacatatt 
gaaaagtgcc 



aacaaaaatt 
cccaggctcc 
ggtgtggaaa 
agtcagcaac 
ccgcccattc 
cctctgcctc 
gcaaaaagct 
ttaatcatcg 
ggccaagttg 
gttctggacc 
ggtccgggac 
caccctggcc 
cgtgtccacg 
gtgggggcgg 
gga.gcaggac 
gggcttcgga 
gctggagttc 
caatagcatc 
gtccaaactc 
ggcgtaatca 
caacatacga 
cacattaatt 
gcattaatga 
ttcctcgctc 
ctcaaaggcg 
agcaaaaggc 
taggctccgc 
cccgacagga 
tgttccgacc 
gctttctcaa 

gggctgtgtg 

tcttgagtcc 
gattagcaga 
cggctacact 
aaaaagagtt 
tgtttgcaag 
ttctacgggg 
attatcaaaa 
ctaaagtata 
tatctcagcg 
aactacgata 
acgctcaccg 
aagtggtcct 
agtaagtagt 
ggtgtcacgc 
agttacatga 
tgtcagaagt 
tcttactgtc 
attctgagaa 
taccgcgcca 
aaaactctca 
caactgatct 
gcaaaatgcc 
cctttttcaa 
tgaatgtatt 
acctgacgtc 



taacgcgaat 
ccaggcaggc 
gtccccaggc 
catagtcccg 
tccgccccat 
tgagctattc 
cccgggagct 
gcatagtata 
accagtgccg 
gaccggctcg 
gacgtgaccc 
tgggtgtggg 
aacttccggg 
gagttcgccc 
tgacacgtgc 
atcgttttcc 
ttcgcccacc 
acaaatttca 
atcaatgtat 
tggtcatagc 
gccggaagca 
gcgttgcgct 
atcggccaac 
actgactcgc 
gtaatacggt 
cagcaaaagg 
ccccctgacg 
ctataaagat 
ctgccgctta 
tgctcacgct 
cacgaacccc 
aacccggtaa 
gcgaggtatg 
agaaggacag 
ggtagctctt 
cagcagatta 
tctgacgctc 
aggatcttca 
tatgagtaaa 
atctgtctat 
cgggagggct 
gctccagatt 
gcaactttat 
tcgccagtta 
tcgtcgtttg 
tcccccatgt 
aagttggccg 
atgccatccg 
tagtgtatgc 
catagcagaa 
aggatcttac 
tcagcatctt 
gcaaaaaagg 
tattattgaa 
tagaaaaata 



taattctgtg 
agaagtatgc 
tccccagcag 
cccctaactc 
ggctgactaa 
cagaagtagt 
tgtatatcca 
tcggcatagt 
ttccggtgct 
ggttctcccg 
tgttcatcag 
tgcgcggcct 
acgcctccgg 
tgcgcgaccc 
tacgagattt 
gggacgccgg 
ccaacttgtt 
caaataaagc 
cttatcatgt 
tgtttcctgt 
taaagtgtaa 
cactgcccgc 
gcgcggggag 
tgcgctcggt 
tatccacaga 
ccaggaaccg 
agcatcacaa 
accaggcgtt 
ccggatacct 
gtaggtatct 
ccgttcagcc 
gacacgactt 
taggcggtgc 
tatttggtat 
gatccggcaa 
cgcgcagaaa 
agtggaacga 
cctagatcct 
cttggtctga 
ttcgttcatc 
taccatctgg 
tatcagcaat 
ccgcctccat 
atagtttgcg 
gtatggcttc 
tgtgcaaaaa 
cagtgttatc 
taagatgctt 
ggcgaccgag 
ctttaaaagt 
cgctgttgag 
ttactttcac 
gaataagggc 
gcatttatca 
aacaaatagg 



4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7960 



<210> 31 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: primer 
<400> 31 

atgggatcca agatgaagcg cgcaagaccg 30 

<210> 32 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 32 

cactatagcg gccgcattct cagtcatctt 3 0 

<210> 33 
<211> 7989 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plasmid pDV69 
<400> 33 

gacggatcgg gagatctccc gatcccctat ggtcgactct cagtacaatc tgctctgatg 60 
ccgcatagtt aagccagtat ctgctccctg cttgtgtgtt ggaggtcgct gagtagtgcg 120 
cgagcaaaat ttaagctaca acaaggcaag gcttgaccga caattgcatg aagaatctgc 180 
ttagggttag gcgttttgcg ctgcttcgcg atgtacgggc cagatatacg cgttgacatt 240 
gattattgac tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata 3 00 
tggagttccg cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc 360 
cccgcccatt gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc 420 
attgacgtca atgggtggac tatttacggt aaactgccca cttggcagta catcaagtgt 4 80 
atcatatgcc aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt 54 0 
atgcccagta catgacctta tgggactttc ctacttggca gtacatctac gtattagtca 600 
tcgctattac catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg 660 
actcacgggg atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc 72 0 
aaaatcaacg ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg 7 80 
gtaggcgtgt acggtgggag gtctatataa gcagagctct ctggctaact agagaaccca 84 0 
ctgcttactg gcttatcgaa attaatacga ctcactatag ggagacccaa gctggctagc 900 
gtttaaactt aagcttggta ccgagctcgg atccactctc ttccgcatcg ctgtctgcga 960 
gggccagctg ttggggtgag tactccctct gaaaagcggg catgacttct gcgctaagat 1020 
tgtcagtttc caaaaacgag gaggatttga tattcacctg gcccgcggtg atgcctttga 1080 
gggtggccgc atccatctgg tcagaaaaga caatcttttt gttgtcaagc ttggtggcaa 1140 
acgacccgta gagggcgttg gacagcaact tggcgatgga gcgcagggtt tggtttttgt 1200 
cgcgatcggc gcgctccttg gccgcgatgt ttagctgcac gtattcgcgc gcaacgcacc 1260 
gccattcggg aaagacggtg gtgcgctcgt cgggcaccag gtgcacgcgc caaccgcggt 1320 
tgtgcagggt gacaaggtca acgctggtgg ctacctctcc gcgtaggcgc tcgttggtcc 1380 
agcagaggcg gccgcccttg cgcgagcaga atggcggtag ggggtctagc tgcgtctcgt 1440 
ccggggggtc tgcgtccacg gtaaagaccc cgggcagcag gcgcgcgtcg aagtagtcta 1500 
tcttgcatcc ttgcaagtct agcgcctgct gccatgcgcg ggcggcaagc gcgcgctcgt 1560 
atgggttgag tgggggaccc catggcatgg ggtgggtgag cgcggaggcg tacatgccgc 1620 
aaatgtcgta aacgtagagg ggctctctga gtattccaag atatgtaggg tagcatcttc 1680 
caccgcggat gctggcgcgc acgtaatcgt atagttcgtg cgagggagcg aggaggtcgg 1740 
gaccgaggtt gctacgggcg ggctgctctg ctcggaagac tatctgcctg aagatggcat 1800 
gtgagttgga tgatatggtt ggacgctgga agacgttgaa gctggcgtct gtgagaccta 1860 
ccgcgtcacg cacgaaggag gcgtaggagt cgcgcagctt gttgaccagc tcggcggtga 1920 
cctgcacgtc tagggcgcag tagtccaggg tttccttgat gatgtcatac ttatcctgtc 1980 
cctttttttt ccacagctcg cggttgagga caaactcttc gcggtctttc cagtactctt 2040 
ggatcggaaa cccgtcggcc tccgaacgag atccgtactc cgccgccgag ggacctgagc 2100 
gagtccgcat cgaccggatc ggaaaacctc tcgagaaagg cgtctaacca gtcacagtcg 2160 
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caagatccaa gatgaagcgc gcaagaccgt 
atgacacgga aaccggtcct ccaactgtgc 
atgggtttca agagagtccc cctggggtac 
cctccaatgg catgcttgcg ctcaaaatgg 
accttacctc ccaaaatgta accactgtga 
taaacctgga aatatctgca cccctcacag 
ccgcacctct aatggtcgcg ggcaacacac 
tgcacgactc caaacttagc attgccaccc 
tagccctgca aacatcaggc cccctcacca 
cctcaccccc tctaactact gccactggta 
atacacaaaa tggaaaacfca ggactaaagt 
taaacacttt gaccgtagca actggtccag 
ctaaagttac tggagccttg ggttttgatt 
gaggactaag gattgattct caaaacagac 
atgctcaaaa ccaactaaat ctaagactag 
acaacttgga tattaactac aacaaaggcc 
aaaagcttga ggttaaccta agcactgcca 
ccattaatgc aggagatggg cttgaatttg 
tcaaaacaaa aattggccat ggcctagaat 
taggaactgg ccttagtttt gacagcacag 
ataagctaac tttgtggacc ggtccaaaac 
aacaaaaccc agatagcaaa ctaactttaa 
gatatgtaac gctaatggga gcctcagact 
tctccattaa tgtagaacta tactttgatg 
ctcttaaaac agatctagaa ctaaaataca 
ttatgccaag tactacagcg tatccatttg 
attatatttt tggtcaatgc tactacaaag 
ttactgttat gcttaataaa cgcctgccag 
tatggtcctt gaatgctggt ctagctccag 
catttacctt ttcctatatt agagaagatg 
tagagggccc gtttaaaccc gctgatcagc 
tgttgtttgc ccctcccccg tgccttcctt 
ttcctaataa aatgaggaaa ttgcatcgca 
gggtggggtg gggcaggaca gcaaggggga 
ggatgcggtg ggctctatgg cttctgaggc 
ggggtatccc cacgcgccct gtagcggcgc 
cagcgtgacc gctacacttg ccagcgccct 
ctttctcgcc acgttcgccg gctttccccg 
gttccgattt agtgctttac ggcacctcga 
acgtagtggg ccatcgccct gatagacggt 
ctttaatagt ggactcttgt tccaaactgg 
ttttgattta taagggattt tggggatttc 
acaaaaattt aacgcgaatt aattctgtgg 
ccaggctccc caggcaggca gaagtatgca 
gtgtggaaag tccccaggct ccccagcagg 
gtcagcaacc atagtcccgc ccctaactcc 
cgcccattct ccgccccatg gctgactaat 
ctctgcctct gagctattcc agaagtagtg 
caaaaagctc ccgggagctt gtatatccat 
taatcatcgg catagtatat cggcatagta 
gccaagttga ccagtgccgt tccggtgctc 
ttctggaccg accggctcgg gttctcccgg 
gtccgggacg acgtgaccct gttcatcagc 
accctggcct gggtgtgggt gcgcggcctg 
gtgtccacga acttccggga cgcctccggg 
tgggggcggg agttcgccct gcgcgacccg 
gagcaggact gacacgtgct acgagatttc 
ggcttcggaa tcgttttccg ggacgccggc 
ctggagttct tcgcccaccc caacttgttt 
aatagcatca caaatttcac aaataaagca 
tccaaactca tcaatgtatc ttatcatgtc 
gcgtaatcat ggtcatagct gtttcctgtg 



ctgaagatac cttcaacccc gtgtatccat 2220 
cttttcttac tcctcccttt gtatccccca 2280 
tctctttgcg cctatccgaa cctctagtta 2340 
gcaacggcct ctctctggac gaggccggca 240 0 
gcccacctct caaaaaaacc aagtcaaaca 2460 
ttacctcaga agccctaact gtggctgccg 2520 
tcaccatgca atcacaggcc ccgctaaccg 2580 
aaggacccct cacagtgtca gaaggaaagc 2 64 0 
ccaccgatag cagtaccctt actatcactg 2700 
gcttgggcat tgacttgaaa gagcccattt 2760 
acggggctcc tttgcatgta acagacgacc 2 82 0 
gtgtgactat taataatact tccttgcaaa 2 880 
cacaaggcaa tatgcaactt aatgtagcag 2 940 
gccttatact tgatgttagt tatccgtttg 3000 
gacagggccc tctttttata aactcagccc 3 060 
tttacttgtt tacagcttca aacaattcca 3120 
aggggttgat gtttgacgct acagccatag 3180 
gttcacctaa tgcaccaaac acaaatcccc 3240 
ttgattcaaa caaggctatg gttcctaaac 3300 
gtgccattac agtaggaaac aaaaataatg 3360 
cagaagccaa ctgcataatt gaatacggga 3420 
tccttgtaaa aaatggagga attgttaatg 3480 
acgttaacac cttatttaaa aacaaaaatg 3 540 
ccactggtca tatattacca gactcatctt 3600 
agcaaaccgc tgactttagt gcaagaggtt 3 660 
tccttcctaa tgcgggaaca cataatgaaa 3 72 0 
caagcgatgg tgcccttttt ccgttggaag 3 780 
atagtcgcac atcctatgtt atgacttttt 3 840 
aaactactca ggcaaccctc ataacctccc 3 900 
actgattttt aagaagcggc cgctcgagtc 3 960 
ctcgactgtg ccttctagtt gccagccatc 4 020 
gaccctggaa ggtgccactc ccactgtcct 4080 
ttgtctgagt aggtgtcatt ctattctggg 4140 
ggattgggaa gacaatagca ggcatgctgg 42 0 0 
ggaaagaacc snccntagct ggggctctag 42 60 
attaagcgcg gcgggtgtgg tggttacgcg 432 0 
agcgcccgct cctttcgctt tcttcccttc 4380 
tcaagctcta aatcggggca tccctttagg 4440 
ccccaaaaaa cttgattagg gtgatggttc 450 0 
ttttcgccct ttgacgttgg agtccacgtt 4560 
aacaacactc aaccctatct cggtctattc 4620 
ggcctattgg ttaaaaaatg agctgattta 4680 
aatgtgtgtc agttagggtg tggaaagtcc 4740 
aagcatgcat ctcaattagt cagcaaccag 4800 
cagaagtatg caaagcatgc atctcaatta 4 860 
gcccatcccg cccctaactc cgcccagttc 4920 
tttttttatt tatgcagagg ccgaggccgc 4980 
aggaggcttt tttggaggcc taggcttttg 5040 
tttcggatct gatcagcacg tgttgacaat 5100 
taatacgaca aggtgaggaa ctaaaccatg 5160 
accgcgcgcg acgtcgccgg agcggtcgag 5220 
gacttcgtgg aggacgactt cgccggtgtg 5280 
gcggtccagg accaggtggt gccggacaac 5340 
gacgagctgt acgccgagtg gtcggaggtc 5400 
ccggccatga ccgagatcgg cgagcagccg 5460 
gccggcaact gcgtgcactt cgtggccgag 5520 
gattccaccg ccgccttcta tgaaaggttg 5580 
tggatgatcc tccagcgcgg ggatctcatg 5640 
attgcagctt ataatggtta caaataaagc 5700 
tttttttcac tgcattctag ttgtggtttg 5760 
tgtataccgt cgacctctag ctagagcttg 5820 
tgaaattgtt atccgctcac aattccacac 5880 
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aacatacgag 
acattaattg 
cattaatgaa 
tcctcgctca 
tcaaaggcgg 
gcaaaaggcc 
aggctccgcc 
ccgacaggac 
gttccgaccc 
ctttctcaat 
ggctgtgtgc 
cttgagtcca 
attagcagag 
ggctacacta 
aaaagagttg 
gtttgcaagc 
tctacggggt 
ttatcaaaaa 
taaagtatat 
atctcagcga 
actacgatac 
cgctcaccgg 
agtggtcctg 
gtaagtagtt 
gtgtcacgct 
gttacatgat 
gtcagaagta 
cttactgtca 
ttctgagaat 
accgcgccac 
aaactctcaa 
aactgatctt 
caaaatgccg 
ctttttcaat 
gaatgtattt 
cctgacgtc 



ccggaagcat 
cgttgcgctc 
tcggccaacg 
ctgactcgct 
taatacggtt 
agcaaaaggc 
cccctgacga 
tataaagata 
tgccgcttac 
gctcacgctg 
acgaaccccc 
acccggtaag 
cgaggtatgt 
gaaggacagt 
gtagctcttg 
agcagattac 
ctgacgctca 
ggatcttcac 
atgagtaaac 
tctgtctatt 
gggagggctt 
ctccagattt 
caactttatc 
cgccagttaa 
cgtcgtttgg 
cccccatgtt 
agttggccgc 
tgccatccgt 
agtgtatgcg 
atagcagaac 
ggatcttacc 
cagcatcttt 
caaaaaaggg 
attattgaag 
agaaaaataa 



aaagtgtaaa 
actgcccgct 
cgcggggaga 
gcgctcggtc 
atccacagaa 
caggaaccgt 
gcatcacaaa 
ccaggcgttt 
cggatacctg 
taggtatctc 
cgttcagccc 
acacgactta 
aggcggtgct 
atttggtatc 
atccggcaaa 
gcgcagaaaa 
gtggaacgaa 
ctagatcctt 
ttggtctgac 
tcgttcatcc 
accatctggc 
atcagcaata 
cgcctccatc 
tagtttgcgc 
tatggcttca 
gtgcaaaaaa 
agtgttatca 
aagatgcttt 
gcgaccgagt 
tttaaaagtg 
gctgttgaga 
tactttcacc 
aataagggcg 
catttatcag 
acaaataggg 



gcctggggtg 
ttccagtcgg 
ggcggtttgc 
gttcggctgc 
tcaggggata 
aaaaaggccg 
aatcgacgct 
ccccctggaa 
tccgcctttc 
agttcggtgt 
gaccgctgcg 
tcgccactgg 
acagagttct 
tgcgctctgc 
caaaccaccg 
aaaggatctc 
aactcacgtt 
ttaaattaaa 
agttaccaat 
atagttgcct 
cccagtgctg 
aaccagccag 
cagtctatta 
aacgttgttg 
ttcagctccg 
gcggttagct 
ctcatggtta 
tctgtgactg 
tgctcttgcc 
ctcatcattg 
tccagttcga 
agcgtttctg 
acacggaaat 
ggttattgtc 
gttccgcgca 



cctaatgagt 
gaaacctgtc 
gtattgggcg 
ggcgagcggt 
acgcaggaaa 
cgttgctggc 
caagtcagag 
gctccctcgt 
tcccttcggg 
aggtcgttcg 
ccttatccgg 
cagcagccac 
tgaagtggtg 
tgaagccagt 
ctggtagcgg 
aagaagatcc 
aagggatttt 
aatgaagttt 
gcttaatcag 
gactccccgt 
caatgatacc 
ccggaagggc 
attgttgccg 
ccattgctac 
gttcccaacg 
ccttcggtcc 
tggcagcact 
gtgagtactc 
cggcgtcaat 
gaaaacgttc 
tgtaacccac 
ggtgagcaaa 
gttgaatact 
tcatgagcgg 
catttccccg 



gagctaactc 
gtgccagctg 
ctcttccgct 
atcagctcac 
gaacatgtga 
gtttttccat 
gtggcgaaac 
gcgctctcct 
aagcgtggcg 
ctccaagctg 
taactatcgt 
tggtaacagg 
gcctaactac 
taccttcgga 
tggttttttt 
tttgatcttt 
ggt cat gaga 
taaatcaatc 
tgaggcacct 
cgtgtagata 
gcgagaccca 
cgagcgcaga 
ggaagctaga 
aggcatcgtg 
atcaaggcga 
tccgatcgtt 
gcataattct 
aaccaagtca 
acgggataat 
ttcggggcga 
tcgtgcaccc 
aacaggaagg 
catactcttc 
atacatattt 
aaaagtgcca 



<210> 34 

<211> 7607 

<212> DNA 

<213> Artificial Sequence 
<220> 



5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
7989 



<223> Description of Artificial Sequence: plasmid GRE5-El-SV40-Hygro 



<400> 34 

tctagaagat 

ttctagctac 

gtacaggatg 

agatcgatct 

gcgcgttcgt 

aacttcaggg 

atgttatatg 

tatcaccatg 

ttgtctcctc 

ttgtaacgaa 

tcactttttt 

acaattgtta 

ggaaatattc 

tggttacaat 

ccctctgcta 

gttattgtgc 



ccgctgtaca 
tttattagat 
ttctagctac 
cctggccgtt 
cctcactctc 
tgagtttggg 
gagggggcaa 
gaccctcatg 
ttattttctt 
tttttaaatt 
ttcaaggcaa 
taattaaatg 
ttattggtag 
gatatacact 
accatgttca 
tgtctcatca 



ggatgttcta 
ccgctgtaca 
tttattagat 
cggggtcaaa 
ttccgcatcg 
gacccttgat 
agttttcagg 
ataattttgt 
ttcattttct 
cacttttgtt 
tcagggtata 
ataaggtaga 
aaacaactac 
gtttgagatg 
tgccttcttc 
ttttggcaaa 



gctactttat 
ggatgttcta 
ccgtgtacag 
aaccaggttt 
ctgtctgcga 
tgttctttct 
gtgttgttta 
ttctttcact 
gtaacttttt 
tatttgtcag 
ttatattgta 
atatttctgc 
atcctggtca 
aggataaaat 
tttttcctac 
gaattagatc 



tagatccgct 
gctactttat 
gatgttctag 
ggctataaaa 
gggccaggat 
ttttcgctat 
gaatgggaag 
ttctactctg 
cgttaaactt 
attgtaagta 
cttcagcaca 
atataaattc 
tcatcctgcc 
actctgagtc 
agctcctggg 
taagcttctg 



gtacaggatg 60 
tagatccgct 120 
ctactttatt 180 
gggggtgggg 240 

cgatcctgag 300 
tgtaaaattc 360 
atgtcccttg 420 
ttgacaacca 480 
tagcttgcat 540 
ctttctctaa 600 
gttttagaga 660 
tggctggcgt 720 
tttctcttta 780 
caaaccgggc 840 
caacgtgctg 900 
cagctcgagg 960 
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actcggtcga 
ggccgccagt 
tagccatttt 
agatcccaac 
ggaagggatt 
ttcccggcag 
tgtaccggag 
ggatgaagag 
gtcttgtcat 
tatgaggacc 
agtggtgggt 
ttttgtattg 
gaaccggagc 
ccgacatcac 
tctaacacac 
gtgagagttg 
gggcaacctt 
cgtgtgtggt 
ataatgttta 
cgtgggctaa 
tctgctgtgc 
ctgtggggct 

tttgaagagc 
gcgcttttcc 
gctgctgttg 

ggggggtacc 

aatcgcctgc 
cagcagcagg 
ggcctggacc 
gcattttgac 
cttgtgaggc 
ctgagtgtat 
cgcagaagta 
aggaggctat 
gcaaacttgt 
tagatacgga 
ttggcatgga 
cggttttcct 
atacctgtgt 
ggaagggggt 
ggtgtacctt 
actgtggttg 
gcaactgcga 
tgaagaccat 
tactgacccg 
gcaatttgag 
acggggtgtt 
ccaggtgcag 
atgtgaccga 
gctctagcga 
gaaagaatat 
ccgccatgag 
tgcccccatg 
tcctgcccgc 
ctgcagcctc 
ttgctttcct 
agttgacggc 
agcagctgtt 
cggtttaaaa 
gctgtctcag 
ttaaagcaga 
aatttcacaa 



ctgaaaatga 
cttttggacc 
gaaccaccta 
gaggaggcgg 
gacttactca 
cccgagcagc 
gtgatcgatc 
ggtgaggagt 
tatcaccgga 
tgtggcatgt 
ttggtgtggt 
tgattttttt 
ctgcaagacc 
ctgtgtctag 
ctcctgagat 
gtgggcgtcg 
tggacttgag 
taacgccttt 
acttgcatgg 
tcttggttac 
gtaacttgct 
catcccaggc 
ttttgaaatc 
aagagaaggt 
cttttttgag 
tgctggattt 
tactgttgtc 
aggaagccag 
ctcgggaatg 
aattacagag 
tacagaggag 
tacttttcaa 
ttccatagag 
tagggtatat 
aaatatcagg 
ggatagggtg 
cggggtggtt 
ggccaatacc 
ggaagcctgg 

ggtgtgtcgc 
gggtatcctg 
cttcatgcta 
ggacagggcc 
tcacgtagcc 
ctgttccttg 
tcacactaag 
tgacatgacc 
accctgcgag 
ggagctgagg 
tgaagataca 
ataaggtggg 
caccaactcg 
ggccggggtg 
aaactctact 
cgccgccgct 
gagcccgctt 
tcttttggca 
ggatctgcgc 
cataaataaa 
ctgactgctt 
acttgtttat 
ataaagcatt 



gacatattat 
agctgatcga 
cccttcacga 
tttcgcagat 
cttttccgcc 
cggagcagag 
ttacctgcca 
ttgtgttaga 
ggaatacggg 
ttgtctacag 
aatttttttt 
aaaaggtcct 
tacccgccgt 
agaatgcaat 
acacccggtg 
ccaggctgtg 
ctgtaaacgc 
gtttgctgaa 
cgtgttaaat 
atctgacctc 
ggaacagagc 
aaagttagtc 
ctgtggtgag 
catcaagact 
ttttataaag 
tctggccatg 
ttccgtccgc 
gcggcggcgg 
aatgttgtac 
gatgggcagg 
gctaggaatc 
cagatcaagg 
cagctgacca 
gcaaaggtgg 
aattgttgct 
gcctttagat 
attatgaatg 
aaccttatcc 
accgatgtaa 
cccaaaagca 
tctgagggta 
gtgaaaagcg 
tctcagatgc 
agccactctc 
catttgggta 
atattgcttg 
atgaagatct 
tgtggcggta 
cccgatcact 
gattgaggta 
ggtcttatgt 
tttgatggaa 
cgtcagaatg 
accttgacct 
tcagccgctg 
gcaagcagtg 
caattggatt 
cagcaggttt 
aaaccagact 
aagtcgcaag 
tgcagcttat 
tttttcactg 



ctgccacgga 
agaggtactg 
actgtatgat 
ttttcccgac 
ggcgcccggt 
agccttgggt 
cgaggctggc 
ttafcgtggag 
ggacccagat 
taagtgaaaa 
ttaattttta 
gtgtctgaac 
cctaaaatgg 
agtagtacgg 
gtcccgctgt 
gaatgtatcg 
cccaggccat 
tgagttgatg 
ggggcggggc 
atggaggctt 
tctaacagta 
tgcagaatta 
ctgtttgatt 
ttggattttt 
gataaatgga 
catctgtgga 
ccggcgataa 
caggagcaga 
aggtggctga 
ggctaaaggg 
tagcttttag 
ataattgcgc 
cttactggct 
cacttaggcc 
acatttctgg 
gtagcatgat 
taaggtttac 
tacacggtgt 
gggttcgggg 
gggcttcaat 
actccagggt 
tggctgtgat 
tgacctgctc 
gcaaggcctg 
acaggagggg 
agcccgagag 
ggaaggtgct 
aacatattag 
tggtgctggc 
ctgaaatgtg 
agttttgtat 
gcattgtgag 
tgatgggctc 
acgagaccgt 
cagccaccgc 
cagcttcccg 
ctttgacccg 
ctgccctgaa 
ctgtttggat 
ccgaattgga 
aatggttaca 
cattctagtt 



ggtgttatta 
gctgataatc 
ttagacgtga 
tctgtaatgt 
tctccggagc 
ccggtfctcta 
tttccaccca 
caccccgggc 
attatgtgtt 
ttatgggcag 
cagttttgtg 
ctgagcctga 
cgcctgctat 
atagctgtga 
gccccattaa 
aggacttgct 
aaggtgtaaa 
taagtttaat 
ttaaagggta 

gggagtgttt 
cctcttggtt 
aggaggatta 
ctttgaatct 
ccacaccggg 
gcgaagaaac 
gagcggttgt 
taccgacgga 
gcccatggaa 
actgtatcca 
ggtaaagagg 
cttaatgacc 
taatgagctt 
gcagccaggg 
agattgcaag 
gaacggggcc 
aaatatgtgg 
tggccccaat 
aagcttctat 
ctgtgccttt 
taagaaatgc 
gcgccacaat 
taagcataac 
ggacggcaac 
gccagtgttt 
ggtgttccta 
catgtccaag 
gaggtacgat 
gaaccagcct 
ctgcacccgc 
tgggcgtggc 
ctgttttgca 
ctcatatttg 
cagcattgat 
gtctggaacg 
ccgcgggatt 
ttcatccgcc 
ggaacttaat 
ggcttcctcc 
ttggatcaag 
tccaattcgg 
aataaagcaa 
gtggtttgtc 



ccgaagaaat 
ttccacctcc 
cggcccccga 
tggcggtgca 
cgcctcacct 
tgccaaacct 
gtgacgacga 
acggttgcag 
cgctttgcta 
tgggtgatag 
gtttaaagaa 
gcccgagcca 
cctgagacgc 
ctccggtcct 
accagttgcc 
taacgagcct 
cctgtgattg 
aaagggtgag 
tataatgcgc 
ggaagatttt 
ttggaggttt 
caagtgggaa 
gggtcaccag 
gcgcgctgcg 
ccatctgagc 
gagacacaag 
ggagcagcag 
cccgagagcc 
gaactgagac 
gagcgggggg 
agacaccgtc 
gatctgctgg 
gatgattttg 
tacaagatca 
gaggtggaga 

ccgggggtgc 

tttagcggta 
gggtttaaca 
tactgctgct 
ctctttgaaa 
gtggcctccg 
atggtatgtg 
tgtcacctgc 
gagcataaca 
ccttaccaat 
gtgaacctga 
gagacccgca 
gtgatgctgg 
gctgagtttg 
ttaagggtgg 
gcagccgccg 
acaacgcgca 
ggtcgccccg 
ccgttggaga 
gtgactgact 
cgcgatgaca 
gtcgtttctc 
cctcccaatg 
caagtgtctt 
atcgatctta 
tagcatcaca 
caaactcatc 



1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
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aatgtatctt atcatgtctg 
ctgcgctcgg tcgttcggct 
ttatccacag aatcagggga 
gccaggaacc gtaaaaaggc 
gagcatcaca aaaatcgacg 
taccaggcgt ttccccctgg 
accggatacc tgtccgcctt 
tgtaggtatc tcagttcggt 
cccgttcagc ccgaccgctg 
agacacgact tatcgccact 
gtaggcggtg ctacagagtt 
gtatttggta tctgcgctct 
tgatccggca aacaaaccac 
acgcgcagaa aaaaaggatc 
cagtggaacg aaaactcacg 
acctagatcc ttttaaatta 
acttggtctg acagttacca 
tttcgttcat ccatagttgc 
ttaccatctg gccccagtgc 
ttatcagcaa taaaccagcc 
tccgcctcca tccagtctat 
aatagtttgc gcaacgttgt 
ggtatggctt cattcagctc 
ttgtgcaaaa aagcggttag 
gcagtgttat cactcatggt 
gtaagatgct tttctgtgac 
cggcgaccga gttgctcttg 
actttaaaag tgctcatcat 
ccgctgttga gatccagttc 
tttactttca ccagcgtttc 
ggaataaggg cgacacggaa 
agcatttatc agggttattg 
aaacaaatag gggttccgcg 
attattatca tgacattaac 
gcgtttcggt gatgacggtg 
ttgtctgtaa gcggatgccg 
cgggtgtcgg ggctggctta 
tatgcggtgt gaaataccgc 
agcgttaata ttttgttaaa 
caataggccg aaatcggcaa 
agtgttgttc cagtttggaa 
gggcgaaaaa ccgtctatca 
tttttggggt cgaggtgccg 
agagcttgac ggggaaagcc 
gcgggcgcta gggcgctggc 
gcgcttaatg cgccgctaca 
gaagggcgat cggtgcgggc 
gcaaggcgat taagttgggt 
gccagtgaat tgtaatacga 



gtcgactcta gactcttccg 
gcggcgagcg gtatcagctc 
taacgcagga aagaacatgt 
cgcgttgctg gcgtttttcc 
ctcaagtcag aggtggcgaa 
aagctccctc gtgcgctctc 
tctcccttcg ggaagcgtgg 
gtaggtcgtt cgctccaagc 
cgccttatcc ggtaactatc 
ggcagcagcc actggtaaca 
cttgaagtgg tggcctaact 
gctgaagcca gttaccttcg 
cgctggtagc ggtggttttt 
tcaagaagat cctttgatct 
ttaagggatt ttggtcatga 
aaaatgaagt tttaaatcaa 
atgcttaatc agtgaggcac 
ctgactcccc gtcgtgtaga 
tgcaatgata ccgcgagacc 
agccggaagg gccgagcgca 
taattgttgc cgggaagcta 
tgccattgct acaggcatcg 
cggttcccaa cgatcaaggc 
ctccttcggt cctccgatcg 
tatggcagca ctgcataatt 
tggtgagtac tcaaccaagt 
cccggcgtca atacgggata 
tggaaaacgt tcttcggggc 
gatgtaaccc actcgtgcac 
tgggtgagca aaaacaggaa 
atgttgaata ctcatactct 
tctcatgagc ggatacatat 
cacatttccc cgaaaagtgc 
ctataaaaat aggcgtatca 
aaaacctctg acacatgcag 
ggagcagaca agcccgtcag 
actatgcggc atcagagcag 
acagatgcgt aaggagaaaa 
attcgcgtta aatttttgtt 
aatcccttat aaatcaaaag 
caagagtcca ctattaaaga 
gggcgatggc ccactacgtg 
taaagcacta aatcggaacc 
ggcgaacgtg . gcgagaaagg 
aagtgtagcg gtcacgctgc 
gggcgcgtcc cattcgccat 
ctcttcgcta ttacgccagc 
aacgccaggg ttttcccagt 
ctcactatag ggcgaattaa 



cttcctcgct cactgactcg 4740 
actcaaaggc ggtaatacgg 4800 
gagcaaaagg ccagcaaaag 4 860 
ataggctccg cccccctgac 4920 
acccgacagg actataaaga 4980 
ctgttccgac cctgccgctt 5040 
cgctttctca tagctcacgc 5100 
tgggctgtgt gcacgaaccc 5160 
gtcttgagtc caacccggta 5220 
ggattagcag agcgaggtat 52 80 
acggctacac tagaaggaca 5340 
gaaaaagagt tggtagctct 54 00 
ttgtttgcaa gcagcagatt 5460 
tttctacggg gtctgacgct 5520 
gattatcaaa aaggatcttc 5580 
tctaaagtat atatgagtaa 5640 
ctatctcagc gatctgtcta 5700 
taactacgat acgggagggc 5760 
cacgctcacc ggctccagat 5820 
gaagtggtcc tgcaacttta 5880 
gagtaagtag ttcgccagtt 5940 
tggtgtcacg ctcgtcgttt 6000 
gagttacatg atcccccatg 6060 
ttgtcagaag taagttggcc 6120 
ctcttactgt catgccatcc 6180 
cattctgaga atagtgtatg 6240 
ataccgcgcc acatagcaga 63 00 
gaaaactctc aaggatctta 6360 
ccaactgatc ttcagcatct 6420 
ggcaaaatgc cgcaaaaaag 64 80 
tcctttttca atattattga 6540 
ttgaatgtat ttagaaaaat 6600 
cacctgacgt ctaagaaacc 6660 
cgaggcccct ttcgtctcgc 6720 
ctcccggaga cggtcacagc 6780 
ggcgcgtcag cgggtgttgg 6840 
attgtactga gagtgcacca 6900 
taccgcatca ggaaattgta 6960 
aaatcagctc attttttaac 7020 
aatagaccga gatagggttg 7080 
acgtggactc caacgtcaaa 7140 
aaccatcacc ctaatcaagt 7200 
ctaaagggag cccccgattt 7260 
aagggaagaa agcgaaagga 7320 
gcgtaaccac cacacccgcc 73 80 
tcaggctgcg caactgttgg 7440 
tggcgaaagg gggatgtgct 7500 
cacgacgttg taaaacgacg 7560 
ttcgggg 7607 



<210> 35 
<211> 11600 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plasmid MMTV~E2a-SV40-Neo 
<400> 35 

gaattccgca ttgcagagat attgtattta agtgcctagc tcgatacaat aaacgccatt 60 
tgaccattca ccacattggt gtgcacctcc aagcttgggc agaaatggtt gaactcccga 120 
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gagtgtccta cacctagggg agaagcagcc 
tctgcgcaca aacggatgag cccatcagac 
ggcatagctc tgctttgcct ggggctattg 
ctcacccttg actcttttaa tagctcttct 
aactcgacct tcctcctgag gcaaggacca 
ttttgtcctt cagaaataga aataagaatg 
accaatccaa taggtagatt attagttact 
actattttta ctcaaattca gaagttagaa 
acctcaattg aagaacaggt gcaaggacta 
aaaaagagtg tttttgtcaa aataggagac 
ccttacatct acagaccaac agatgccccc 
tgggataggt gggttacagt caatggctat 
gaaagactcg ccagagctag acctccttgg 
atgaaacaac aggtacatga ttatatttat 
attttccata ccaaggaggg gacagtggct 
catggcatga gttattatga atagccttta 
agtaagtttt tggttacaaa ctgttcttaa 
cttggtttgg tatcaaaggt tctgatctga 
ttggaattta tccaaatctt atgtaaatgc 
attttttgag taaacttgca acagtcctaa 
gccatcccgt ctccgctcgt cacttatcct 
cggcgaccct caggtcggcc gactgcggca 
gtgacccttg tctctatttc tactatttgg 
ggctatcatc acaagagcgg aacggactca 
cgtccaagac cctcaaagat ttttggcact 
gcgccctgcc gcaaggccag ctgcttgtcc 
ggtatcttgc agttttggaa aaagatgtga 
acggggtaga agttgaggcg cgggttgggc 
ggtacgcgcg gtgagaatag gtggcgttcg 
ggcacatcgc tgcgctcttg caacgcgtcg 
ttcaacagca cgtcgtctcc cacatctagg 
acttgttcct cgtttgcctc tgcgttgtcc 
gagcggtcct cgtcgtcttc gcttacaaaa 
tcctcaagcg ggggtgcctc gacggggaag 
gcggtggtgg cgaactcaga gggggcggtt 
atctttttct gcctatagga gaaggaaatg 
acccccgagc gcggacgcgg tgcggcgcga 
ccgtccccgt cgccgccgcc tccccgggcg 
gagtccgagg acgaggaaga ctcatcacaa 
cggccatcga cctcggcggc ggatttggcc 
tctcccaagc ccgagcgccc gccatcacca 
gaagatgtgg cgctacaaat ggtgggtttc 
aaaggaggta agcgcacagt gcggcggctg 
cggacgcaag aggaagagga agagcccagc 
ccgctgagtg tgccgatcgt gtctgcgtgg 
atggacaagt accacgtgga taacgatcta 
gtggaagctc tggcggccgt atgcaagacc 
ctgaccttca ccagcaacaa gacctttgtg 
ctgcagtcgt ttgcagaggt gacctacaag 
ctgcaccgct gcgctgagat cgaaggcgag 
aataaggagc acgtgattga aatggatgtg 
gagcagtcta gcaaggccaa gatcgtgaag 
tccaacaccg acgcaaggtg ctgcgtgcac 
ggcaagtctt gcggcatgtt cttctctgaa 
atcaaggctt ttatgcaggc gctgtatcct 
atgccactac ggtgcgagtg caactcaaag 
ctaccaaagt tgactccgtt cgccctgagc 
tccgacaaga gcgtgctggc cagcgtgcac 
aaccctgtgt atcgcaactc gcgcgcgcag 
tcggcgcccg acctgctaaa cgcgttggtg 
accgagctgc cgcggatggt tgtgcctgag 
aacgtgtccc tgccagtggc gcatagcgat 



aaggggttgt ttcccaccaa ggacgacccg 180 
aaagacatat tcattctctg ctgcaaactt 240 
ggggaagttg cggttcgtgc tcgcagggct 300 
gtgcaagatt acaatctaaa caattcggag 3 60 
cagccaactt cctcttacaa gccgcatcga 420 
cttgctaaaa attatatttt taccaataag 480 
atgttaagaa atgaatcatt atcttttagt 540 
atgggaatag aaaatagaaa gagacgctca 600 
ttgaccacag gcctagaagt aaaaaaggga 660 
aggtggtggc aaccagggac ttatagggga 720 
ttaccatata caggaagata tgacttaaat 780 
aaagtgttat atagatccct cccttttcgt 840 
tgtatgttgt ctcaagaaga aaaagacgac 900 
ctaggaacag gaatgcactt ttggggaaag 960 
ggactaatag aacattattc tgcaaaaact 1020 
ttggcccaac cttgcggttc ccagggctta 1080 
aacgaggatg tgagacaagt ggtttcctga 1140 
gctctgagtg ttctattttc ctatgttctt 1200 
ttatgtaaac caagatataa aagagtgctg 1260 
cattcacctc ttgtgtgttt gtgtctgttc 1320 
tcactttcca gagggtcccc ccgcagaccc 13 80 
gctggcgccc gaacagggac cctcggataa 1440 
tgtttgtctt gtattgtctc tttcttgtct 1500 
ccatagggac caagctagcg cttctcgtcg 1560 
tcgttgagcg aggcgatatc aggtatgaca 1620 
gctcggctgc ggttggcacg gcaggatagg 1680 
taggtggcaa gcacctctgg cacggcaaat 1740 
tcgcatgtgc cgttttcttg gcgtttgggg 18 00 
taggcaaggc tgacatccgc tatggcgagg 1860 
cagataatgg cgcactggcg ctgcagatgc 1920 
tagtcgccat gcctttcgtc cccccgcccg 1980 
tggtcttgct ttttatcctc tgttggtact 2040 
cctgggtcct gctcgataat cacttcctcc 2100 
gtggtaggcg cgttggcggc atcggtggag 2160 
aggctgtcct tcttctcgac tgactccatg 2220 
gccagtcggg aagaggagca gcgcgaaacc 2280 
cgtcccccaa ccatggagga cgtgtcgtcc 2340 
cccccaaaaa agcggatgag gcggcgtatc 24 00 
gacgcgctgg tgccgcgcac acccagcccg 2460 
attgcgccca agaagaaaaa gaagcgccct 2520 
gaggtaatcg tggacagcga ggaagaaaga 2580 
agcaacccac cggtgctaat caagcatggc 2640 
aatgaagacg acccagtggc gcgtggtatg 2700 
gaagcggaaa gtgaaattac ggtgatgaac 2760 
gagaagggca tggaggctgc gcgcgcgctg 2820 
aaggcgaact tcaaactact gcctgaccaa 2880 
tggctgaacg aggagcaccg cgggttgcag 2940 
acgatgatgg ggcgattcct gcaggcgtac 3000 
catcacgagc ccacgggctg cgcgttgtgg 3060 
cttaagtgtc tacacggaag cattatgata 3120 
acgagcgaaa acgggcagcg cgcgctgaag 3180 
aaccggtggg gccgaaatgt ggtgcagatc 3240 
gacgcggcct gtccggccaa tcagttttcc 3300 
ggcgcaaagg ctcaggtggc ttttaagcag 3360 
aacgcccaga ccgggcacgg tcaccttttg 3420 
cctgggcacg cgcccttttt gggaaggcag 3480 
aacgcggagg acctggacgc ggatctgatc 3540 
cacccggcgc tgatagtgtt ccagtgctgc 3600 
ggcggaggcc ccaactgcga cttcaagata 3660 
atggtgcgca gcctgtggag tgaaaacttc 3720 
tttaagtgga gcactaaaca ccagtatcgc 3780 
gcgcggcaga acccctttga tttttaaacg 3840 
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gcgcagacgg 
cctttattga 
gtggcgctcc 
ctgtagagct 
gagttgggta 
cagttggcaa 
ccctgtacgc 
gcgctacggc 
gaaaagccaa 
atccagtgta 
tgtggagaaa 
ccaagatctt 
gttgatgttg 
acacggtttc 
acaaaacata 
ttacctcagg 
aattggacaa 
tataatgtgt 
ctgatgaatg 
aaatgccatc 
agaagagaaa 
atgctgtgtt 
ctgcactgct 
acagttataa 
ttaataacta 
aggaatattt 
gaggttttac 
aatgcaattg 
agcatcacaa 
aaactcatca 
tgtggaaagt 
tcagcaacca 
catctcaatt 
ccgcccagtt 
gccgaggccg 
ctaggctttt 
ggaagcggaa 
gctactgggc 
gtgggcttac 
tgccagctgg 
tcttgccgcc 
gatcgtttcg 
agaggctatt 
tccggctgtc 
tgaatgaact 
gcgcagctgt 
tgccggggca 
ctgatgcaat 
cgaaacatcg 
atctggacga 
gcatgcccga 
tggtggaaaa 
gctatcagga 
ctgaccgctt 
atcgccttct 
gacgcccaac 
cttcggaatc 
ggagttcttc 
gctggacgac 
cagcggctat 
tttggtcccg 
tacctacaga 



caagggtggg 
aagtgtctct 
taatctgcgc 
gttcctggtt 
ccccggtaat 
aggcgtggag 
tttgggtgga 
gcaggagtga 
agggctcaaa 
cgcccccagt 
caaagcctgg 
tgacaatggc 
cttgcttctt 
gatgacgccg 
aagaagggtg 
tcgacctcga 
actacctaca 
taaactactg 
ggagcagtgg 
tagtgatgat 
ggtagaagac 
tagtaataga 
atacaagaaa 
tcataacata 
tgctcaaaaa 
gatgtatagt 
ttgctttaaa 
ttgttgttaa 
atttcacaaa 
atgtatctta 
ccccaggctc 
ggtgtggaaa 
agtcagcaac 
ccgcccattc 
cctcggcctc 
gcaaaaagct 
cacgtagaaa 
tatctggaca 
atggcgatag 
ggcgccctct 
aaggatctga 
catgattgaa 
cggctatgac 
agcgcagggg 
gcaggacgag 
gctcgacgtt 
ggatctcctg 
gcggcggctg 
catcgagcga 
agagcatcag 
cggcgaggat 
tggccgcttt 
catagcgttg 
cctcgtgctt 
tgacgagttc 
ctgccatcac 
gttttccggg 
gcccaccccg 
ctcgcggagt 
ccgcgcatcc 
gatctttgtg 
gatttaaagc 



ggtaaataat 
agtacattat 
actgtggctg 
gcgacgcagg 
aaggttcatg 
aaacatgcag 
cttttccagc 
ctcgtactca 
gaggtagcat 
ctcgcgaccg 
aaagcgcttg 
tttcagttcc 
tatgttgtgg 

cggtgcggcc 

ggctcgtcca 
gggatctttg 
gagatttaaa 
attctaattg 
tggaatgcct 
gaggctactg 
cccaaggact 
actcttgctt 
attatggaaa 
ctgttttttc 
ttgtgtacct 
gccttgacta 
aaacctccca 
cttgtttatt 
taaagcattt 
tcatgtctgg 
cccagcaggc 
gtccccaggc 
catagtcccg 
tccgccccat 
tgagctattc 
tcacgctgcc 
gccagtccgc 
agggaaaacg 
ctagactggg 
ggtaaggttg 
tggcgcaggg 
caagatggat 
tgggcacaac 
cgcccggttc 
gcagcgcggc 
gtcactgaag 
tcatctcacc 
catacgcttg 
gcacgtactc 
gggctcgcgc 
ctcgtcgtga 
tctggattca 
gctacccgtg 
tacggtatcg 
ttctgagcgg 
gagatttcga 
acgccggctg 
ggctcgatcc 
tctaccggca 
atgcccccga 
aaggaacctt 
tctaaggtaa 



cacccgagag 
ttttacatgt 
cggaagtagg 
gtgggctgta 
gtggggttgt 
cagaatagtc 
gttatacagc 
aactggtaaa 
gtttttgagt 
gccgtattga 
tcataggtgc 
tgctcactgg 
cgttgccggc 
ggtgcacacg 
tgggatccat 
tgaaggaacc 
gctctaaggt 
tttgtgtatt 
ttaatgagga 
ctgactctca 
ttccttcaga 
gctttgctat 
aatattctgt 
ttactccaca 
ttagcttttt 
gagatcataa 
cacctccccc 
gcagcttata 
ttttcactgc 
atccggctgt 
agaagtatgc 
tccccagcag 
cccctaactc 
ggctgactaa 
cagaagtagt 
gcaagcactc 
agaaacggtg 
caagcgcaaa 
cggttttatg 
ggaagccctg 
gatcaagatc 
tgcacgcagg 
agacaatcgg 
tttttgtcaa 
tatcgtggct 
cgggaaggga 
ttgctcctgc 
atccggctac 
ggatggaagc 
cagccgaact 
cccatggcga 
tcgactgtgg 
atattgctga 
ccgctcccga 
gactctgggg 
ttccaccgcc 
gatgatcctc 
cctcgcgagt 
gtgcaaatcc 
actgcaggag 
acttctgtgg 
atataaaatt 



tgtacaaata 
ttttcaagtg 
gcgagtggcg 
cctggggact 
gatccatggg 
cacaggcggc 
ggtcggggga 
cctgcttgag 
gcgggttcca 
ctatggcgca 
ccaaaaaata 
agcccatggc 
cgagaagggc 
gaccacgtca 
atatagggcc 
ttacttctgt 
aaatataaaa 
ttagattcca 
aaacctgttt 
acattctact 
attgctaagt 
ttacaccaca 
aacctttata 
caggcataga 
aatttgtaaa 
tcagccatac 
tgaacctgaa 
atggttacaa 
attctagttg 
ggaatgtgtg 
aaagcatgca 
gcagaagtat 
cgcccatccc 
ttttttttat 
gaggaggctt 
agggcgcaag 
ctgaccccgg 
gagaaagcag 
gacagcaagc 
caaagtaaac 
tgatcaagag 
ttctccggcc 
ctgctctgat 
gaccgacctg 
ggccacgacg 
ctggctgcta 
cgagaaagta 
ctgcccattc 
cggtcttgtc 
gttcgccagg 
tgcctgcttg 
ccggctgggt 
agagcttggc 
ttcgcagcgc 
ttcgaaatga 
gccttctatg 
cagcgcgggg 
tggttcagct 
gtcggcatcc 
tggggaggca 
tgtgacataa 
tttaagtgta 



aaagcatttg 
acaaaaagaa 
ctccaggaag 
gttgagcatg 
agtttggggc 
cgagttgggc 
agaagcaatg 
tcgctggtca 
ggcaaaggcc 
ggcgagcttg 
tggcccacaa 
ggcagctgtt 
gtgcgcaggt 
aagacttcaa 
cgggttataa 
ggtgtgacat 
tttttaagtg 
acctatggaa 
tgctcagaag 
cctccaaaaa 
tttttgagtc 
aaggaaaaag 
agtaggcata 
gtgtctgcta 

ggggttaata 

cacatttgta 
acataaaatg 
ataaagcaat 
tggtttgtcc 
tcagttaggg 
tctcaattag 
gcaaagcatg 
gcccctaact 
ttatgcagag 
ttttggaggc 
ggctgctaaa 
atgaatgtca 
gtagcttgca 
gaaccggaat 
tggatggctt 
acaggatgag 
gcttgggtgg 
gccgccgtgt 
tccggtgccc 
ggcgttcctt 
ttgggcgaag 
tccatcatgg 
gaccaccaag 
gatcaggatg 
ctcaaggcgc 
ccgaatatca 
gtggcggacc 
ggcgaatggg 
atcgccttct 
ccgaccaagc 
aaaggttggg 
atctcatgct 
gctgcctgag 
aggaaaccag 
cgatggccgc 
ttggacaaac 
taatgtgtta 



3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
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aactactgat tctaattgtt tgtgtatttt agattccaac ctatggaact gatgaatggg 762 0 
agcagtggtg gaatgccttt aatgaggaaa acctgttttg ctcagaagaa atgccatcta 7680 
gtgatgatga ggctactgct gactctcaac attctactcc tccaaaaaag aagagaaagg 774 0 
tagaagaccc caaggacttt ccttcagaat tgctaagttt tttgagtcat gctgtgttta 7800 
gtaatagaac tcttgcttgc tttgctattt acaccacaaa ggaaaaagct gcactgctat 7860 
acaagaaaat tatggaaaaa tattctgtaa cctttataag taggcataac agttataatc 7920 
ataacatact gttttttctt actccacaca ggcatagagt gtctgctatt aataactatg 7980 
ctcaaaaatt gtgtaccttt agctttttaa tttgtaaagg ggttaataag gaatatttga 8040 
tgtatagtgc cttgactaga gatcataatc agccatacca catttgtaga ggttttactt 8100 
gctttaaaaa acctcccaca cctccccctg aacctgaaac ataaaatgaa tgcaattgtt 8160 
gttgttaact tgtttattgc agcttataat ggttacaaat aaagcaatag catcacaaat 8220 
ttcacaaata aagcattttt ttcactgcat tctagttgtg gtttgtccaa actcatcaat 8280 
gtatcttatc atgtctggat ccccaggaag ctcctctgtg tcctcataaa ccctaacctc 8340 
ctctacttga gaggacattc caatcatagg ctgcccatcc accctctgtg tcctcctgtt 8400 
aattaggtca cttaacaaaa aggaaattgg gtaggggttt ttcacagacc gctttctaag 84 60 
ggtaatttta aaatatctgg gaagtccctt ccactgctgt gttccagaag tgttggtaaa 8520 
cagcccacaa atgtcaacag cagaaacata caagctgtca gctttgcaca agggcccaac 85 80 
accctgctca tcaagaagca ctgtggttgc tgtgttagta atgtgcaaaa caggaggcac 8640 
attttcccca cctgtgtagg ttccaaaata tctagtgttt tcatttttac ttggatcagg 87 00 
aacccagcac tccactggat aagcattatc cttatccaaa acagccttgt ggtcagtgtt 8760 
catctgctga ctgtcaactg tagcattttt tggggttaca gtttgagcag gatatttggt 8820 
cctgtagttt gctaacacac cctgcagctc caaaggttcc ccaccaacag caaaaaaatg 8880 
aaaatttgac ccttgaatgg gttttccagc accattttca tgagtttttt gtgtccctga. 8940 
atgcaagttt aacatagcag ttaccccaat aacctcagtt ttaacagtaa cagcttccca 9000 
catcaaaata tttccacagg ttaagtcctc atttaaatta ggcaaaggaa ttcttgaaga 9060 
cgaaagggcc tcgtgatacg cctattttta taggttaatg tcatgataat aatggtttct 9120 
tagacgtcag gtggcacttt tcggggaaat gtgcgcggaa cccctatttg tttatttttc 9180 
taaatacatt caaatatgta tccgctcatg agacaataac cctgataaat gcttcaataa 924 0 
tattgaaaaa ggaagagtat gagtattcaa catttccgtg tcgcccttat tccctttttt 9300 
gcggcatttt gccttcctgt ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct 9360 
gaagatcagt tgggtgcacg agtgggttac atcgaactgg atctcaacag cggtaagatc 942 0 
cttgagagtt ttcgccccga agaacgtttt ccaatgatga gcacttttaa agttctgcta 94 80 
tgtggcgcgg tattatcccg tgttgacgcc gggcaagagc aactcggtcg ccgcatacac 9540 
tattctcaga atgacttggt tgagtactca ccagtcacag aaaagcatct tacggatggc 9600 
atgacagtaa gagaattatg cagtgctgcc ataaccatga gtgataacac tgcggccaac 9660 
ttacttctga caacgatcgg aggaccgaag gagctaaccg cttttttgca caacatgggg 9720 
gatcatgtaa ctcgccttga tcgttgggaa ccggagctga atgaagccat accaaacgac 9780 
gagcgtgaca ccacgatgcc tgcagcaatg gcaacaacgt tgcgcaaact attaactggc 9840 
gaactactta ctctagcttc ccggcaacaa ttaatagact ggatggaggc ggataaagtt 9900 
gcaggaccac ttctgcgctc ggcccttccg gctggctggt ttattgctga taaatctgga 9960 
gccggtgagc gtgggtctcg cggtatcatt gcagcactgg ggccagatgg taagccctcc 10020 
cgtatcgtag ttatctacac gacggggagt caggcaacta tggatgaacg aaatagacag 100 80 
atcgctgaga taggtgcctc actgattaag cattggtaac tgtcagacca agtttactca 10140 
tatatacttt agattgattt aaaacttcat ttttaattta aaaggatcta ggtgaagatc 10200 
ctttttgata atctcatgac caaaatccct taacgtgagt tttcgttcca ctgagcgtca 10260 
gaccccgtag aaaagatcaa aggatcttct tgagatcctt tttttctgcg cgtaatctgc 10320 
tgcttgcaaa caaaaaaacc accgctacca gcggtggttt gtttgccgga tcaagagcta 103 80 
ccaactcttt ttccgaaggt aactggcttc agcagagcgc agataccaaa tactgtcctt 1044 0 
ctagtgtagc cgtagttagg ccaccacttc aagaactctg tagcaccgcc tacatacctc 10500 
gctctgctaa tcctgttacc agtggctgct gccagtggcg ataagtcgtg tcttaccggg 10560 
ttggactcaa gacgatagtt accggataag gcgcagcggt cgggctgaac ggggggttcg 10620 
tgcacacagc ccagcttgga gcgaacgacc tacaccgaac tgagatacct acagcgtgag 10680 
ctatgagaaa gcgccacgct tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc 10740 
agggtcggaa caggagagcg cacgagggag cttccagggg gaaacgcctg gtatctttat 10800 
agtcctgtcg ggtttcgcca cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg 10860 
gggcggagcc tatggaaaaa cgccagcaac gcggcctttt tacggttcct ggccttttgc 10920 
tggccttttg ctcacatgtt ctttcctgcg ttatcccctg attctgtgga taaccgtatt 10980 
accgcctttg agtgagctga taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca 11040 
gtgagcgagg aagcggaaga gcgcctgatg cggtattttc tccttacgca tctgtgcggt 11100 
atttcacacc gcatatggtg cactctcagt acaatctgct ctgatgccgc atagttaagc 11160 
cagtatctgc tccctgcttg tgtgttggag gtcgctgagt agtgcgcgag caaaatttaa 11220 
gctacaacaa ggcaaggctt gaccgacaat tgcatgaaga atctgcttag ggttaggcgt 11280 
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tttgcgctgc ttcgcgatgt acgggccaga tatacgcgta tctgagggga ctagggtgtg 11340 
tttaggcgaa aagcggggct tcggttgtac gcggttagga gtcccctcag gatatagtag 114 00 
tttcgctttt gcatagggag ggggaaatgt agtcttatgc aatacacttg tagtcttgca 11460 
acatggtaac gatgagttag caacatgcct tacaaggaga gaaaaagcac cgtgcatgcc 11520 
gattggtgga agtaaggtgg tacgatcgtg ccttattagg aaggcaacag acgggtctga 11580 
catggattgg acgaaccact , 11600 

<210> 36 
<211> 53 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 36 

gtcactcgag gactcggtcg actgaaaatg agacatatta tctgccacgg acc 53 



<210> 37 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 37 

cgagatcgat cacctccggt acaaggtttg gcatag 3 6 

<210> 38 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 38 

catgaagatc tggaaggtgc tgaggtacga tgagacc 37 

<210> 39 
<211> 51 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 39 

gcgacttaag cagtcagctg agacagcaag acacttgctt gatccaaatc c 51 

<210> 40 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<400> 40 

cacgaattcg tcagcgcttc tcgtcgcgtc caagac.cc 



38 
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<210> 41 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<400> 41 

caccccgggg aggcggcggc gacggggacg gg 

<210> 42 
<211> 7231 
<212> DNA 

<213> Artificial Sequence 



32 



<220> 

<223> Description of Artificial Sequence: plasmid pDV80 
<400> 42 

ctgctccctg cttgtgtgtt ggaggtcgct gagtagtgcg cgagcaaaat ttaagctaca 60 
acaaggcaag gcttgaccga caattgcatg aagaatctgc ttagggttag gcgttttgcg 120 
ctgcttcgcg atgtacgggc cagatatacg cgttgacatt gattattgac tagttattaa 180 
tagtaatcaa ttacggggtc attagttcat agcccatata tggagttccg cgttacataa 240 
cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt gacgtcaata 300 
atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca atgggtggac 3 60 
tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc aagtacgccc 420 
cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta catgacctta 480 
tgggactttc ctacttggca gtacatctac gtattagtca tcgctattac catggtgatg 540 
cggttttggc agtacatcaa tgggcgtgga tagcggtttg actcacgggg atttccaagt 6 00 
ctccacccca ttgacgtcaa tgggagtttg ttttggcacc aaaatcaacg ggactttcca 660 
aaatgtcgta acaactccgc cccattgacg caaatgggcg gtaggcgtgt acggtgggag 720 
gtctatataa gcagagctct ctggctaact agagaaccca ctgcttactg gcttatcgaa 780 
attaatacga ctcactatag ggagacccaa gctggctagc gtttaaactt aagcttggta 840 
ccgagctcgg atccactctc ttccgcatcg ctgtctgcga gggccagctg ttggggtgag 9 00 
tactccctct gaaaagcggg catgacttct gcgctaagat tgtcagtttc caaaaacgag 960 
gaggatttga tattcacctg gcccgcggtg atgcctttga gggtggccgc atccatctgg 1020 
tcagaaaaga caatcttttt gttgtcaagc ttggtggcaa acgacccgta gagggcgttg 1080 
gacagcaact tggcgatgga gcgcagggtt tggtttttgt cgcgatcggc gcgctccttg 1140 
gccgcgatgt ttagctgcac gtattcgcgc gcaacgcacc gccattcggg aaagacggtg 1200 
gtgcgctcgt cgggcaccag gtgcacgcgc caaccgcggt tgtgcagggt gacaaggtca 1260 
acgctggtgg ctacctctcc gcgtaggcgc tcgttggtcc agcagaggcg gccgcccttg 1320 
cgcgagcaga atggcggtag ggggtctagc tgcgtctcgt ccggggggtc tgcgtccacg 1380 
gtaaagaccc cgggcagcag gcgcgcgtcg aagtagtcta tcttgcatcc ttgcaagtct 1440 
agcgcctgct gccatgcgcg ggcggcaagc gcgcgctcgt atgggttgag tgggggaccc 1500 
catggcatgg ggtgggtgag cgcggaggcg tacatgccgc aaatgtcgta aacgtagagg 1560 
ggctctctga gtattccaag atatgtaggg tagcatcttc caccgcggat gctggcgcgc 1620 
acgtaatcgt atagttcgtg cgagggagcg aggaggtcgg gaccgaggtt gctacgggcg 1680 
ggctgctctg ctcggaagac tatctgcctg aagatggcat gtgagttgga tgatatggtt 1740 
ggacgctgga agacgttgaa gctggcgtct gtgagaccta ccgcgtcacg cacgaaggag 1800 
gcgtaggagt cgcgcagctt gttgaccagc tcggcggtga cctgcacgtc tagggcgcag 1860 
tagtccaggg tttccttgat gatgtcatac ttatcctgtc cctttttttt ccacagctcg 1920 
cggttgagga caaactcttc gcggtctttc cagtactctt ggatcggaaa cccgtcggcc 1980 
tccgaacgag atccgtactc cgccgccgag ggacctgagc gagtccgcat cgaccggatc 2040 
ggaaaacctc tcgagaaagg cgtctaacca gtcacagtcg caagatccaa gatgaagcgc 2100 
gcccgcccca gcgaagatga cttcaacccc gtctacccct atggctacgc gcggaatcag 2160 
aatatcccct tcctcactcc cccctttgtc tcctccgatg gattcaaaaa cttcccccct 2220 
ggggtactgt cactcaaact ggctgatcca atcaccatta ccaatgggga tgtatccctc 2280 
aaggtgggag gtggtctcac tttgcaagat ggaagcctaa ctgtaaaccc taaggctcca 2340 
ctgcaagtta atactgataa aaaacttgag cttgcatatg ataatccatt tgaaagtagt 2400 
gctaataaac ttagtttaaa agtaggacat ggattaaaag tattagatga aaaaagtgct 2460 



WO 01/83729 



46 



PCT/EP01/04863 



gcggggttaa aagatttaat tggcaaactt 
gaaaatttag aaaatacaga tggtagcagc 
gaagggttga catttgacaa tgatggatac 
cgcacacttt ggacaacacc agacacatct 
tctaaactca ctttggtact tacaaagtgt 
attgtggtcg caggaaagta ccacatcata 
tttactatta aactgctatt taataagaac 
aaagcttatt ggaactttag aagtggaaat 
attggtttta tgcctaattt ggtagcgtat 
agagacatag tttatggaac tatatatctt 
aaaactacct ttaaccaaga aactggatgt 
tccaaaacct atgaaaatgt tgaatttgaa 
caagaatgaa agagcggccg ctcgagtcta 
cgactgtgcc ttctagttgc cagccatctg 
ccctggaagg tgccactccc actgtccttt 
gtctgagtag gtgtcattct attctggggg 
attgggaaga caatagcagg catgctgggg 
aaagaaccag ctggggctct agggggtatc 
cggcgggtgt ggtggttacg cgcagcgtga 
ctcctttcgc tttcttccct tcctttctcg 
taaatcgggg catcccttta gggttccgat 
aacttgatta gggtgatggt tcacgtagtg 
ctttgacgtt ggagtccacg ttctttaata 
tcaaccctat ctcggtctat tcttttgatt 
ggttaaaaaa tgagctgatt taacaaaaat 
tcagttaggg tgtggaaagt ccccaggctc 
atctcaatta gtcagcaacc aggtgtggaa 
tgcaaagcat gcatctcaat tagtcagcaa 
cgcccctaac tccgcccagt tccgcccatt 
tttatgcaga ggccgaggcc gcctctgcct 
tttttggagg cctaggcttt tgcaaaaagc 
ctgatcagca cgtgttgaca attaatcatc 
caaggtgagg aactaaacca tggccaagtt 
cgacgtcgcc ggagcggtcg agttctggac 
ggaggacgac ttcgccggtg tggtccggga 
ggaccaggtg gtgccggaca acaccctggc 
gtacgccgag tggtcggagg tcgtgtccac 
gaccgagatc ggcgagcagc cgtgggggcg 
ctgcgtgcac ttcgtggccg aggagcagga 
cgccgccttc tatgaaaggt tgggcttcgg 
cctccagcgc ggggatctca tgctggagtt 
ttataatggt tacaaataaa gcaatagcat 
actgcattct agttgtggtt tgtccaaact 
gtcgacctct agctagagct tggcgtaatc 
ttatccgctc acaattccac acaacatacg 
tgcctaatga gtgagctaac tcacattaat 
gggaaacctg tcgtgccagc tgcattaatg 
gcgtattggg cgctcttccg cttcctcgct 
gcggcgagcg gtatcagctc actcaaaggc 
taacgcagga aagaacatgt gagcaaaagg 
cgcgttgctg gcgtttttcc ataggctccg 
ctcaagtcag aggtggcgaa acccgacagg 
aagctccctc gtgcgctctc ctgttccgac 
tctcccttcg ggaagcgtgg cgctttctca 
gtaggtcgtt cgctccaagc tgggctgtgt 
cgccttatcc ggtaactatc gtcttgagtc 
ggcagcagcc actggtaaca ggattagcag 
cttgaagtgg tggcctaact acggctacac 
gctgaagcca gttaccttcg gaaaaagagt 
cgctggtagc ggtggttttt ttgtttgcaa 
tcaagaagat cctttgatct tttctacggg 
ttaagggatt ttggtcatga gattatcaaa 



gtggttttaa caggaaaagg aataggcact 2520 
agaggaattg gtataaatgt aagagcaaga 25 80 
ttggtagcat ggaacccaaa gtatgacacg 2640 
ccaaactgca caattgctca agataaggac 2700 
ggaagtcaaa tattagctaa tgtgtctttg 2760 
aataataaga caaatccaaa aataaaaagt 2 820 
ggagtgcttt tagacaactc aaatcttgga 28 80 
tccaatgttt cgacagctta tgaaaaagca 294 0 
ccaaaaccca gtaattctaa aaaatatgca 3 0 00 
ggtggaaaac ctgatcagcc agcagtcatt 3 060 
gaatactcta tcacatttaa ctttagttgg 3120 
accacctctt ttaccttctc ctatattgcc 3180 
gagggcccgt ttaaacccgc tgatcagcct 3240 
ttgtttgccc ctcccccgtg ccttccttga 33 00 
cctaataaaa tgaggaaatt gcatcgcatt 3360 
gtggggtggg gcaggacagc aagggggagg 3420 
atgcggtggg ctctatggct tctgaggcgg 3480 
cccacgcgcc ctgtagcggc gcattaagcg 3540 
ccgctacact tgccagcgcc ctagcgcccg 3600 
ccacgttcgc cggctttccc cgtcaagctc 3660 
ttagtgcttt acggcacctc gaccccaaaa 3720 
ggccatcgcc ctgatagacg gtttttcgcc 3780 
gtggactctt gttccaaact ggaacaacac 3 840 
tataagggat tttggggatt tcggcctatt 3 900 
ttaacgcgaa ttaattctgt ggaatgtgtg 3960 
cccaggcagg cagaagtatg caaagcatgc 4 020 
agtccccagg ctccccagca ggcagaagta 4080 
ccatagtccc gcccctaact ccgcccatcc 4140 
ctccgcccca tggctgacta atttttttta 4200 
ctgagctatt ccagaagtag tgaggaggct 4260 
tcccgggagc ttgtatatcc attttcggat 4320 
ggcatagtat atcggcatag tataatacga 43 80 
gaccagtgcc gttccggtgc tcaccgcgcg 4440 
cgaccggctc gggttctccc gggacttcgt 4500 
cgacgtgacc ctgttcatca gcgcggtcca 4560 
ctgggtgtgg gtgcgcggcc tggacgagct 462 0 
gaacttccgg gacgcctccg ggccggccat 4 680 
ggagttcgcc ctgcgcgacc cggccggcaa 4740 
ctgacacgtg ctacgagatt tcgattccac 4800 
aatcgttttc cgggacgccg gctggatgat 4860 
cttcgcccac cccaacttgt ttattgcagc 4920 
cacaaatttc acaaataaag catttttttc 4980 
catcaatgta tcttatcatg tctgtatacc 5040 
atggtcatag ctgtttcctg tgtgaaattg 5100 
agccggaagc ataaagtgta aagcctgggg 5160 
tgcgttgcgc tcactgcccg ctttccagtc 5220 
aatcggccaa cgcgcgggga gaggcggttt 5280 
cactgactcg ctgcgctcgg tcgttcggct 5340 
ggtaatacgg ttatccacag aatcagggga 5400 
ccagcaaaag gccaggaacc gtaaaaaggc 5460 
cccccctgac gagcatcaca aaaatcgacg 5520 
actataaaga taccaggcgt ttccccctgg 5580 
cctgccgctt accggatacc tgtccgcctt 5640 
atgctcacgc tgtaggtatc tcagttcggt 5700 
gcacgaaccc cccgttcagc ccgaccgctg 5760 
caacccggta agacacgact tatcgccact 5820 
agcgaggtat gtaggcggtg ctacagagtt 5880 
tagaaggaca gtatttggta tctgcgctct 5940 
tggtagctct tgatccggca aacaaaccac 6000 
gcagcagatt acgcgcagaa aaaaaggatc 6060 
gtctgacgct cagtggaacg aaaactcacg 6120 
aaggatcttc acctagatcc ttttaaatta 6180 
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aaaatgaagt tttaaatcaa tctaaagtat atatgagtaa acttggtctg acagttacca 6240 
atgcttaatc agtgaggcac ctatctcagc gatctgtcta tttcgttcat ccatagttgc 63 00 
ctgactcccc gtcgtgtaga taactacgat acgggagggc ttaccatctg gccccagtgc 6360 
tgcaatgata ccgcgagacc cacgctcacc ggctccagat ttatcagcaa taaaccagcc 6420 
agccggaagg gccgagcgca gaagtggtcc tgcaacttta tccgcctcca tccagtctat 6480 
taattgttgc cgggaagcta gagtaagtag ttcgccagtt aatagtttgc gcaacgttgt 6540 
tgccattgct acaggcatcg tggtgtcacg ctcgtcgttt ggtatggctt cattcagctc 6600 
cggttcccaa cgatcaaggc gagttacatg atcccccatg ttgtgcaaaa aagcggttag 6660 
ctccttcggt cctccgatcg ttgtcagaag taagttggcc gcagtgttat cactcatggt 6720 
tatggcagca ctgcataatt ctcttactgt catgccatcc gtaagatgct tttctgtgac 6780 
tggtgagtac tcaaccaagt cattctgaga atagtgtatg cggcgaccga gttgctcttg 6840 
cccggcgtca atacgggata ataccgcgcc acatagcaga actttaaaag tgctcatcat 6900 
tggaaaacgt tcttcggggc gaaaactctc aaggatctta ccgctgttga gatccagttc 6960 
gatgtaaccc actcgtgcac ccaactgatc ttcagcatct tttactttca ccagcgtttc 7020 
tgggtgagca aaaacaggaa ggcaaaatgc cgcaaaaaag ggaataaggg cgacacggaa 7 080 
atgttgaata ctcatactct tcctttttca atattattga agcatttatc agggttattg 7140 
tctcatgagc ggatacatat ttgaatgtat ttagaaaaat aaacaaatag gggttccgcg 7200 
cacatttccc cgaaaagtgc cacctgacgt c 7231 

<210> 43 
<211> 48 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 43 

tgtcttggat ccaagatgaa gcgcgcccgc cccagcgaag atgacttc 4 8 



<210> 44 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 44 

aaacacggcg gccgctcttt cattcttg 28 

<210> 45 
<211> 17 
<212> PRT 

<213> Ad 37 N-terminus 
<400> 45 

Met Ser Lys Arg Leu Arb Val Glu Asp Asp Phe Asn Pro Val Tyr Pro Tyr 
1 " 5 10 15 

<210> 46 
<211> 6 
<212> PRT 

<213> artificial sequence 
<400> 46 

Lys Arg Ala Arg Pro Ser 
1 5 



<210> 47 
<211> 17 
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<212> PRT 

<213> Ad5 modified N- terminus 
<400> 47 

Met Lys Arg Ala Arg Pro Ser Glu Asp Thr Phe Asn Pro Val Tyr Pro Tyr 
15 10 15 



<210> 48 
<211> 24 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 48 

ggatccatgg gatacttggt agca 24 

<210> 49 
<211> 30 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<400> 49 

gcaactcgag tcattcttgg gcaatatagg 

<210> 50 
<211> 26 
<212> DNA 

<213> Artificial Sequence 



30 



<220> 

<223> Description of Artificial Sequence; primer 
<400> 50 

cgcgctgact cttaaggact agtttc 26 

<210> 51 
<211> 37 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<400> 51 

gcgcttaatt aacatcatca ataatatacc ttatttt 



37 



